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A  bstract 


This  document  describes  the  design  of  a  knowledge-based  system  for  providing 
consultational  advice  on  medical  diagnosis  and  therapy  selection.  A  rich  set  of  diag¬ 
nostic  concepts  is  incorporated  into  the  decision  making  processes,  which  are 
designed  to  resemble  the  reasoning  styles  of  clinicians  and  allow  them  to  examine 
the  decision  steps.  Medical  knowledge  is  represented  with  the  Procedural  Semantic 
Network  (PSN)  formalism  in  a  highly  uniform  fashion.  The  control  structure  is  based 
on  a  formal  model  of  inexact  reasoning  and  two  basic  paradigms  of  Artificial  Intelli¬ 
gence:  hypothesize-and-test,  and  competition  and  co-operation  among  local 
hypotheses.  Some  human  engineering  issues,  aimed  at  enhancing  user  acceptability, 
are  also  addressed  in  the  design.  The  simplicity  and  uniformity  of  the  data  and  con¬ 
trol  structures  allow  an  explanation  system  and  a  knowledge  acquisition  module  to 
be  implemented  easily.  A  prototype  of  the  system  has  been  implemented  and  lim¬ 
ited  testing  shows  some  desirable  features  of  the  system  behaviour. 
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Chapter  1 


Introduction 


1,1.  Introduction 

In  a  monograph  entitled  The  coming  revolution  in  medicine  [Rutstein67]  David 
Rutstein  discussed  the  paradox  of  modern  medicine  —  the  seeming  contradiction  of 
the  mushrooming  medical  research  program  and  the  slowdown  in  the  improvement 
of  the  national  health.  In  recognition  of  the  serious  problems  of  geographic  maldis¬ 
tribution  of  physicians  and  increasing  demand  of  quality  medical  care,  he  antici¬ 
pated  that  the  time  of  physicians  will  be  the  scarcest  and  most  valuable  commodity 
in  medicine  of  the  future.  Diagnosis  is  one  of  the  tasks  that  does  demand  a  great 
deal  of  a  physician’s  time  and  his  competence  can  be  utilized  more  effectively  by 
him  with  the  aid  of  analytical  tools.  Specialties  in  medicine  have  been  subdividing 
and  narrowing,  leading  to  an  explosion  of  medical  knowledge  and  information  that 
cannot  possibly  be  comprehended  by  an  individual  and  incorporated  into  the  deci¬ 
sion  making  processes.  In  this  respect  the  physician  can  take  advantage  of  the 
speed  of  a  computer,  the  reliability  and  capacity  of  its  memory  bank,  and  its  preci¬ 
sion  in  the  matching  of  patterns. 

From  an  academic  point  of  view,  research  in  computer-aided  medical  diagnosis 
helps  to  advance  the  techniques  of  artificial  intelligence  (AI),  especially  in  heuristic 
programming,  knowledge  representation  and  management.  Attempts  at  modeling 
the  reasoning  processes  of  expert  doctors  also  contribute  to  cognitive  studies  on 
human  memory  and  hypothesis  confirmation  in  general.  It  is  a  fascinating  and  edu¬ 
cational  process  for  experts  to  reflect  on  the  approach  that  they  have  always  used 
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when  providing  clinical  consultations.  In  trying  to  state  explicitly  and  precisely  the 
steps  involved  in  clinical  decision  making  for  the  purpose  of  automation,  they  may 
improve  the  teaching  of  their  skills  to  medical  students,  who  usually  learn  about 
decision  making  in  an  unstructured  way,  largely  through  observation.  Attempts  at 
encoding  medical  knowledge  into  computer  programs  may  also  improve  our  under¬ 
standing  of  the  structure  of  this  knowledge,  with  the  associated  development  of 
techniques  for  identifying  its  inadequacies  and  inconsistencies. 


1.2.  The  task 

The  fundamental  task  of  the  system  is  to  provide  consultation  advice  on  the 
diagnosis  and  treatment  of  cardiovascular  diseases.  The  input  data  consist  of 
findings  from  the  history,  the  physical  examination  and  the  laboratory  tests  of  a 
cardiology  patient.  Making  use  of  a  store  of  a  priori  medical  knowledge,  the  system 
produces  the  disease  or  set  of  diseases  that  best  accounts  for  the  given  findings. 
Basing  on  the  final  diagnosis  and  the  clinical  condition  of  the  patient  it  then  recom¬ 
mends  appropriate  therapeutic  interventions.  In  simple  terms,  the  system  is  merely 
a  pattern-matching  program  that  compares  the  stereotypical  manifestations  of  vari¬ 
ous  diseases  with  the  set  of  input  findings  and  looks  for  the  best  fit.  However,  there 
are  several  important  constraints  on  its  design  imposed  by  the  demands  for  high 
performance,  which  will  be  discussed  in  the  remainder  of  this  section. 

An  important  constraint  is  that  the  program  should  permit  the  diagnosis  (or 
diagnoses)  to  be  reached  in  a  sequential  fashion.  That  is,  it  should  not  be  necessary 
for  the  user  to  provide  all  the  necessary  data  at  the  outset.  Instead,  following  the 
same  procedure  that  a  physician  will  take,  the  system  should  be  able  to  suggest  the 
laboratory  or  physical  tests  that  are  most  valuable,  with  respect  to  their  diagnostic 
values  and  relative  costs,  at  each  point  in  the  decision  process  in  an  attempt  to 
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minimize  the  time  and  expense  in  arriving  at  a  diagnosis. 


It  will  also  be  more  natural  for  the  user  to  input  data  in  groups,  such  as  all 
findings  from  a  blood  test,  than  to  do  so  one  at  a  time  strictly  in  response  to  the 
system’s  requests.  This  will  allow  evidence  to  be  considered  as  soon  as  it  is  avail¬ 
able  and  help  to  reduce  the  duration  of  each  session.  One  last  point  about  data 
entry  is  that  facilities  should  be  provided  to  relieve  the  physician  of  the  task  of  rou¬ 
tine  interpretation  of  laboratory  data,  so  that  instead  of  inputting,  say,  "white  blood 
cell  count  too  high"  he  may  enter  the  actual  count.  The  system  should  be  able  to 
detect  abnormalities  from  the  data  and  activate  the  appropriate  hypotheses 
directly. 

The  goal  of  the  system  is  not  to  fully  take  over  the  diagnostic  duties  from  the 
physicians  (this  would  entail  dealing  with  some  entangled  ethical  issues  and  would 
meet  resistance  from  the  physicians),  but  rather  to  serve  as  a  tool  for  them.  As 
such  it  should  allow  different  levels  of  interaction  with  the  user,  possibly  depending 
on  his  expertise  and  his  familarity  with  the  case  at  hand.  For  instance,  he  may 
want  to  take  over  the  program  control  every  so  often  to  focus  the  system’s  atten¬ 
tion  on  what  he  thinks  is  more  likely  or  important,  or  let  it  run  on  its  own  only  to 
check  its  justification  for  the  final  diagnosis  at  the  end.  The  system  may  also  simply 
be  used  for  information  retrieval  for  referential  purposes.  A  compliant  control 
scheme  will  be  needed  to  allow  such  flexible  and  hopefully  effective  interaction 
between  the  user  and  the  system  in  arriving  at  a  diagnosis.  This  system  is  part  of 
an  ongoing  project  which  attempts  to  automate  various  diagnostic  procedures  in 
cardiology.  Other  systems  are  used  for  interpreting  data  from  different  laboratory 
techniques,  such  as  electrocardiography  (ECG)  [ShibaharaBO]  and  angiocardiography 
[Tsotsos80].  They,  in  place  of  the  physician,  will  therefore  function  as  direct  sources 
of  findings  from  their  corresponding  lab  procedures,  which  may  consist  of  not  only 
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indications  of  the  abnormalities  in  the  test  results  but  also  hypotheses  about  the 
possible  causes  of  the  disorders.  Consideration  should  be  given  in  the  design  of  the 
system's  interface  to  the  accommodation  of  such  subsystems.  Communication  in 
both  directions,  in  the  form  of  hypotheses  and  predictions,  should  be  feasible. 

We  would  like  the  system  to  be  competent  at  tackling  real-life  cases,  in  which 
the  evidence  often  does  not  point  in  a  consistent  direction.  Criteria  for  test  results 
may  not  be  clear-cut  and  symptoms  may  not  be  pronounced  enough  to  allow  une¬ 
quivocal  decisions.  The  system  should  therefore  be  able  to  accommodate  such  fuzzi¬ 
ness  in  the  data.  The  situation  may  further  be  complicated  by  the  co-existence  of 
severed  diseases  in  a  patient.  The  findings  must  be  carefully  sorted  out  and  attri¬ 
buted  to  the  possible  causes.  The  system  will  require  effective  attention-focusing 
mechanisms  for  minimizing  the  number  of  hypotheses  being  actively  considered  at 
any  one  time.  Information  on  the  interrelations  among  diseases  will  be  needed  to 
sort  out  the  major  problems  from  the  minor  ones  and  construct  a  coherent  diag¬ 
nosis. 


1.3.  Design  considerations 

There  are  numerous  issues  that  one  has  to  address  when  designing  a  system  for 
medical  computing  [Shortliffe79]  [Cowey80].  Designers  of  computer  programs  for 
giving  advice  to  physicians,  who  can  be  described  as  casual  and  critical  users,  must 
put  in  extra  effort  to  conform  to  their  requirements.  The  importance  of  the  human 
engineering  aspects  of  such  programs  cannot  be  over-emphasized.  They  must  not 
impose  any  unnatural  or  rigid  communication  conventions  on  the  users.  As  a  tool  to 
the  physician,  they  cannot  simply  state  their  inferences  as  dogma  without  justifying 
them.  Developers  of  the  MYCIN  system  (section  1.4)  hold  the  view  that  it  is  a  mis- 
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take  to  concentrate  research  primarily  on  methods  for  improving  the  computer’s 
decision  making  performance  when  clinical  impact  depends  on  solving  other  prob¬ 
lems  of  acceptance  as  well.  In  the  face  of  the  current  lack  of  acceptance  of 
computer-aided  diagnosis  by  the  medical  profession,  they  compiled  a  list  of  prere¬ 
quisites  for  the  acceptance  of  a  clinical  consultation  program  [ShortlifIe76]  as  the 
objectives  of  their  system.  The  system  must  be: 

(l)  useful; 

(£)  educational  when  appropriate; 

(3)  able  to  explain  its  advice; 

(4)  able  to  understand  and  respond  to  simple  questions  stated  in  natural  language; 

(5)  able  to  acquire  new  knowledge,  either  through  experience  or  by  being  told; 

(6)  easily  modifiable. 

These  considerations  are  aimed  at  satisfying  those  principles  that  reflect  the 
system’s  responsibility  to  the  physician  and,  through  him,  to  the  patient.  They  have 
also  served  as  useful  guides  for  the  design  of  our  system. 

Since  medical  knowledge  is  constantly  being  enriched  and  updated  over  time, 
any  systems  that  make  use  of  such  knowledge  must  allow  modifications  to  be  done 
very  easily.  A  knowledge  acquisition  capability  that  helps  the  user  to  pin  down  the 
problem  whenever  errors  in  its  decisions  are  found  will  permit  the  system  to  con¬ 
tinue  to  be  improved.  It  is  therefore  essential  to  provide  the  program  with  a 
mechanism  for  explanation  of  its  decision  steps  and  justification  of  its  final  diag¬ 
nosis.  This  will  also  have  potential  for  educational  purposes.  For  a  system  designed 
to  explain  its  decisions  and  educate  the  user,  there  should  be  a  nonrestrictive 
mechanism  by  which  the  user  can  communicate  with  the  program.  Natural  language 
would  be  ideal  for  the  user  to  express  his  requests.  Providing  for  such  facilities, 
though  in  a  limited  sense,  has  been  attempted  and  has  strongly  influenced  our 
design  of  both  the  knowledge  organization  and  control  structure. 
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1.4.  Background 


In  this  section  we  give  a  brief  account  of  some  of  the  prominent  methodologies 
applied  in  computer-aided  medical  diagnosis.  They  will  later  be  discussed  and  com¬ 
pared. 

Purely  mathematical  techniques  usually  involve  simple  algorithms,  and  can  be 
easily  and  efficiently  encoded  (in  FORTRAN,  say).  They  were  used  in  the  early  diag¬ 
nosis  programs  and  are  still  being  applied  to  various  domains.  However,  drawbacks 
of  such  techniques  are  quickly  realized  and  A1  approaches  have  been  the  focus  of 
recent  research.  We  shall  discuss  the  methodologies  under  these  two  categories  in 
the  following  subsections. 


1.4.1.  Mathematical  approach 

There  are  a  very  limited  number  of  medical  problem  areas  in  which  the  patho¬ 
physiologic  processes  are  so  well  understood  that  they  can  be  characterized  by 
mathematical  formulae,  based  on  a  small  number  of  clinical  parameters.  Programs 
have  been  written  to  assist  the  physician  with  such  problems,  for  example  the 
classification  and  management  of  electrolyte  and  acid-base  disorders  [Bleich72],  the 
evaluation  of  the  respiratory  status  and  formulation  of  therapeutic  recommendations 
for  patients  in  a  respiratory  care  unit  [Menn?3],  and  the  selection  of  a  digoxin  regi¬ 
men  based  on  a  pharmacokinetic  model  of  the  body’s  absorption,  metabolism  and 
excretion  [Peck73]. 

A  slightly  more  general  approach  involves  defining  the  mathematical  relation¬ 
ship  between  a  set  of  measurable  features  \xj]  and  classification  of  disease 
categories  Di  with  pattern-recognition  techniques  [Patrick?7].  In  order  to  find  the 
diagnostic  pattern  or  discrimination  function  of  the  form: 
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**iPi\x)  =  E 

y=i 

the  method  requires  a  training  set  of  cases  to  determine  the  coefficients,  according 
to  a  certain  training  criterion  (e.g.  least  squared  error).  Based  on  the  idea  of 
entropy  minimization,  Kulikowski  developed  a  sequential  formulation  which  was  used 
to  recognize  hyperthyroidism  [Kulikowski70]. 

A  more  flexible  scheme  can  be  obtained  by  making  use  of  separate  statistical 
information  about  the  findings  and  thus  allowing  them  to  be  considered  individually. 
Here  Bayes*  Theorem  provides  a  simple  mechanism  to  calculate  the  probability  of  a 
disease,  in  the  light  of  some  specified  evidence,  from  the  a  priori  probability  of  the 
disease  and  the  conditional  probabilities  relating  the  observations,  E  =  \Ej\,  to  the 
diseases  in  which  they  may  occur: 

p(W*,)= 

Y,P(Ei\E&Dj)P(Dj\E ) 

;  =  i 

This  method  has  some  appeal  since  it  is  potentially  exact,  and  most  work  by  the 
medical  researchers  has  been  done  along  this  approach  [Toronto63]  [Gorry88] 
[Wagner?8].  However,  it  suffers  severely  from  the  problem  of  requiring  a  large 
amount  of  data,  which  is  handled  by  researchers  typically  by  making  a  series  of 
assumptions.  Two  assumptions  usually  made  are  the  exhaustiveness  and  mutual 
exclusion  of  the  disease  categories  and  the  statistical  independence  among  the  pos¬ 
sible  observations.  These  assumptions  are  often  violated  and  techniques  are 
developed  to  improve  the  approximations,  such  as  aggregation  of  dependent  attri¬ 
butes  [Flehinger75]  [Norusis75a,  75b]. 

Other  numerical  models  such  as  discriminant  analysis  [OverallSl],  information 
theory  [Gleser72]  [Wortman72]  and  game  theory  [Dombal59]  [Hockstra76]  have  also 
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been  explored.  However,  they  also  tend  to  be  used  successfully  only  in  small,  well- 
constrained  problem  domains,  because  of  the  large  data  requirements  and  the  dis¬ 
tortions  these  purely  mathematical  schemes  imposed  on  the  problem.  Although  the 
notion  of  likelihood  obviously  plays  a  role  in  the  clinician's  thinking,  it  is  clear  that 
there  is  an  essential  logic  to  the  diagnostic  process,  i.e.  categorical  reasoning  [Szo- 
lovits78],  that  should  be  incorporated  into  any  computer  system  that  is  to  provide 
general  medical  consultation  capability.  Many  researchers  have  turned  to  AI 
problem-solving  techniques  to  find  appropriate  ways  to  combine  these  forms  of  rea¬ 
soning. 


1.4.2.  Artificial  intelligence  approaches 

These  programs  are  perhaps  best  characterized  by  symbolic  processing  and  the 
use  of  heuristic  problem-solving  techniques.  Medical  information  is  represented 
symbolically,  in  a  structure  generally  termed  the  knowledge  base,  and  has  progres¬ 
sively  become  less  dependent  on  pure  observational  data.  Owing  to  the  lack  of  pri¬ 
mary  data  available  for  decision  making,  subjective  judgemental  knowledge  which 
reflects  the  experience  of  an  expert  is  used.  In  addition,  there  is  high-level  stra¬ 
tegic  knowledge  revolving  around  a  specific  physiological  model  of  the  problem 
domain  or  a  cognitive  model  of  human  reasoning  processes,  or  a  combination  of 
each.  Incorporation  of  such  meta-level  knowledge  heightens  the  acceptability  of  the 
system  to  the  users  by  making  the  system  appear  more  aware  of  its  reasoning 
behaviour  and  capable  of  providing  explanations.  It  must  be  emphasized  that  AI 
does  not  necessarily  offer  a  means  for  avoiding  numerical  representation  of  data 
relationships,  but  does  suggest  new  and  potentially  powerful  methods  for  analyzing 
the  problem  domain  and  selecting  relevant  knowledge. 
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Here  we  shall  not  attempt  an  exhaustive  review  of  these  systems  but  briefly 
describe  some  of  the  major  approaches,  emphasizing  those  on  which  the  design  of 
our  system  is  based.  Some  of  their  shortcomings  and  future  directions  will  also  be 
discussed.  There  have  been  several  comprehensive  survey  papers  on  some  prom¬ 
inent  projects  in  this  field  [Pople75a]  [Szolovits78]  [Wagner78]  [Shortlifie79]  and  in 
knowledge  engineering  in  general  [Feigenbaum77]  [Waterman77]. 

MYCIN 

This  system  was  developed  at  Stanford  University  for  providing  consultation  on 
the  choice  of  infectious  disease  therapy  [Shortliffe78j  [Davis77aj.  Knowledge  for  the 
diagnosis  and  therapy  selection  is  represented  as  "packets''  in  the  form  of  produc¬ 
tion  rules,  conditional  statements  which  relate  observations  to  associated  inferences 
that  may  be  drawn.  Typical  examples  are  shown  in  Figure  1.4. 2-1. 

Starting  with  the  goal  of  establishing  the  diagnosis  of  a  significant  infection,  the 
system  uses  a  modiLs  ponens  (backward-chaining)  depth-first  deduction  scheme  to 
trigger  the  applicable  rules  —  the  condition  of  a  rule  is  asserted  by  user-supplied 
data  or  derived  from  other  rules.  Associated  with  each  rule  is  a  measure  of  eviden¬ 
tial  strength,  termed  a  certainty  factor,  and  with  each  fact  there  are  separate 
measures  of  evidence.  These  parameters  together  form  the  basis  for  the  system’s 
inexact  reasoning  scheme.  The  confirmation  formalism  defined  for  computing  the 
certainty  of  facts  provides  a  simple  yet  intuitively  appealing  scheme  for  handling 
uncertainty  of  the  hypotheses  and  fuzziness  in  the  data.  It  is  adopted  in  our  system 
with  slight  modifications  and  will  be  discussed  in  detail  in  chapter  3. 

As  mentioned  before,  user  acceptability  has  been  a  major  objective  of  the 
MYCIN  system  design.  The  developers  attribute  its  high  level  of  competence  to  the 
use  of  modular,  stylized  coding,  readily  translatable  to  simple  English,  as  a 
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knowledge  representation.  This  allows  the  knowledge  base  to  be  examined  and 
modified  easily.  The  relatively  uncomplicated  control  structure  is  also  made  possi¬ 
ble  by  the  lack  of  interactions  among  rules.  Since  rules  are  retrieved  because  they 
arc  relevant  to  a  specific  goal  (i.e.,  they  mention  that  goal  in  their  action  part),  a 
straightforward  explanation  system  can  be  implemented.  Adequate  question¬ 
answering  capability  can  be  achieved  through  the  retrieval  of  relevant  rules,  guided 
by  pre-established  lists  which  indicate  premise  and  action  contents,  or  by  specified 
events  (questions  and  conclusions)  happenning  during  the  consultation. 

There  have  been  several  successful  applications  of  MYCIN’s  data  and  control 
structures  with  rules  applicable  to  a  variety  of  problem  areas,  such  as  fault  diag¬ 
nosis  and  repair  recommendations  for  bugs  in  an  automobile  born  system,  industrial 
assembly  problems  and  the  engineering  domain  of  structural  analysis  [Bennett79]. 
However,  research  in  other  areas  showed  that  the  production  system  is  not  suitable 
for  some  domains,  especially  where  numerous  complexly  related  parameters  are 
involved.  It  is  somewhat  harder  to  express  a  given  piece  of  knowledge  if  it  must  be 
put  into  a  rather  restricted  form.  The  requirement  of  modularity  also  means  all 
necessary  contextual  information  must  be  stated  explicitly  in  the  premise,  which  at 
times  becomes  awkwardly  long  and  complicated.  In  implementing  a  production  sys¬ 
tem  as  a  psychopharmocology  advisor,  Brooks  and  Heiser  [Brc>oks79]  found  it 
difficult  to  encode  knowledge  that  involves  time-related  data,  and  they  saw  the  need 
for  more  involved  strategies  for  controlling  the  question  asking.  The  simple 
backward-chaining/goal-directed  control  structure  does  not  provide  an  attention- 
focusing  mechanism  for  acquiring  new  information.  Attempts  are  being  made  to 
incorporate  conceptual  structures  and  organization  of  medical  knowledge  into  the 
knowledge  base  by  introducing  some  frame-like  data  structures  so  as  to  facilitate 
the  implementation  of  more  flexible  problem-solving  strategies  [Aikins79]  [Chan- 
drasekaran79].  This  will  be  further  discussed  when  we  explain  in  the  following 
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chapter  why  we  have  abandoned  the  use  of  production  rules  altogether  for  our  prob¬ 
lem  domain. 


IF:  1)  THE  IDENTITY  OF  THE  ORGANISM  IS  NOT  KNOWN  WITH  CERTAINTY,  AND 

2)  THE  STAIN  OF  THE  ORGANISM  IS  GRAMNEG,  AND 

3)  THE  MORPHOLOGY  OF  THE  ORGANISM  IS  ROD,  AND 

4)  THE  AEROBICITY  OF  THE  ORGANISM  IS  AEROBIC 

THEN:  THERE  IS  STRONGLY  SUGGESTIVE  EVIDENCE  (.8)  THAT  THE  CLASS  OF 
ORGANISM  IS  ENTEROBACTERIACEAE 

IF:  1)  THE  THERAPY  UNDER  CONSIDERATION  IS  GENTAMICIN.  AND 

2)  THE  IDENTITY  OF  THE  ORGANISM  MAY  BE  PSEUDOMONAS,  AND 

3)  THE  PATIENT’S  DEGREE  OF  SICKNESS  (ON  A  SCALE  OF  4)  IS 
GREATER  THAN  3,  AND 

4)  [A-  THE  PATIENT  IS  NOT  ALLERGIC  TO  ANTIBIOTICS,  OR 

B-  CARBENICILLIN  IS  NOT  A  DRUG  TO  WHICH  THE  PATIENT  IS  ALLERGIC] 

THEN:  RECORD  THE  FOLLOWING  AS  A  MORE  APPROPRIATE  THERAPY  THAN  GENTAMICIN: 
THE  COMBINATION  OF  GENTAMICIN  WITH  CARBENINILLIN. 


Figure  1.4.2- 1  Examples  of  MYCIN’s  rules 
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INTERNIST 


INTERNIST  is  a  diagnosis  system,  developed  by  Pople  and  Myers  at  the  Univer¬ 
sity  of  Pittsburgh,  currently  covering  an  estimated  (by  the  authors)  60-80%  of  inter¬ 
nal  medicine  [Pople75b,  75c,  77].  In  the  INTERNIST  knowledge  base  each  possible 
diagnosis  Dj  is  associated  with  a  set  of  manifestations  [Mj\.  The  strength  of  the 
association  is  characterized  by  two  parameters,  assessed  on  a  discrete  scale  of  0  to 
5:  the  likelihood  that  manifestation  Mj,  when  present  in  a  patient,  is  caused  by  Z)*, 
and  the  likelihood  that  a  patient  with  diagnosis  would  exhibit  Mj,  are  termed  the 
evoking  strength  and  frequency  respectively.  All  possible  diagnoses  are  classified 
into  a  disease  hierarchy  and  for  each  general  diagnosis  a  list  of  manifestations  and 
their  corresponding  evoking  strengths  and  frequencies  are  computed.  This  allows 
disease  hypotheses  to  be  generated  at  a  more  general  level  when  no  discriminating 
information  is  yet  available.  An  example  of  a  diagnosis  with  its  associated  informa¬ 
tion  is  shown  in  Figure  1.4. 2-2. 

The  use  of  the  two  likelihoods  assigned  to  each  diagnosis-manifestation  relation¬ 
ship  allows  data  to  be  applied  independently  as  evidence  and  makes  possible  the 
implementation  of  a  flexible  focusing  mechanism.  A  score  is  maintained  for  each 
hypothesis  --  evocative  findings  and  confirmed  consequences  counting  in  its  favour, 
while  unexplained  and  absent  expected  findings  counting  against  it.  In  selecting  the 
next  request  for  data,  INTERNIST  concentrates  on  the  highest  ranking  hypothesis. 
As  a  heuristic,  other  hypotheses  which,  together  with  this  chosen  one,  account  for 
no  more  findings  than  either  alone  are  considered  to  be  competing.  The  system 
then  proceeds  to  choose  among  these  competitors  using  a  question  strategy 
selected  according  to  the  number  and  the  cost  of  the  information  to  be  requested, 
until  one  hypothesis  has  a  substantially  higher  score  than  the  others. 
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INTERNIST  is  reported  to  demonstrate  competent  performance  including  the 
handling  of  multiple  diagnoses.  It  however  pursues  all  abnormal  findings  to  comple¬ 
tion.  even  though  a  clinician  often  ignores  minor  unexplained  abnormalities  when  the 
rest  of  a  patient’s  clinical  status  is  well  understood.  Costs  of  running  the  program 
could  be  significant  because  the  complete  scoring,  partitioning  and  strategy  selec¬ 
tion  processes  are  repeated  after  each  new  fact  is  reported.  With  this  control  struc¬ 
ture  the  system  appears  to  digress  from  one  topic  to  another  as  soon  as  new  infor¬ 
mation  suggests  a  new  possibility.  Work  is  being  done  on  a  second  implementation 
of  the  system  to  employ  a  hypothesis  generator  that  uses  the  concept  of  constric¬ 
tors  to  delineate  the  top-level  structure  of  a  complex  problem  and  that  uses  a 
modified  scoring  algorithm  that  considers  within  each  sub-problem  only  those 
findings  judged  to  be  relevant  in  that  context.  There  are  the  unaddressed  problems 
concerning  human  engineering  issues  centered  on  the  usability  of  the  program's 
interface.  A  related  issue  is  the  assignment  of  the  numerical  parameters  stored  in 
the  knowledge  base.  The  significance  of  each  individual  parameter  is  not  at  all 
apparent  in  the  scoring  scheme.  The  score  is  computed  by  manipulating  parame¬ 
ters  of  different  nature  together:  (i)  summing  the  evoking  strengths  of  all  its  man¬ 
ifestations  which  have  been  observed,  (ii)  adding  "bonus"  points  for  confirmed 
causally  consequent  diagnoses,  (iii)  subtracting  the  sum  of  frequencies  of  those  of 
its  manifestations  which  are  known  to  be  absent,  and  also  (iv)  subtracting  a  weight 
of  importance  for  each  significant  finding  which  is  reported  to  be  present  but  which 
is  not  explained  by  either  the  diagnosis  or  some  other  confirmed  diagnosis. 
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PORTAL- VEIN-OCCLUSION 


Manifestation 

evoking  strength 

frequency 

HEPATIC-VEIN-WEDGE-PRESSURE-NORMAL 

0 

4 

SPLENOMEGALY 

1 

4 

GASTR0-INTEST1NAL-HEM0RRHAGE 

1 

4 

VARICES-ESOPHOGEAL 

2 

4 

PORTAL- VEI N-  OBS  TRU  CTION-BY-  RADIOGRAPHY 

5 

3 

ANEMIA 

1 

3 

APPENDICITIS-HISTORY 

1 

2 

ASCITES 

1 

2 

Figure  1.4. 2-2  Example  of  information  associated  with  a  diagnosis 
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PIP 


The  Present  Illness  Program  (PIP)  has  been  developed  by  the  Clinical  Decision 
Making  Group  at  MIT  for  the  domain  of  renal  diseases  [Pauker76].  The  PIP  database 
associates  each  disease  hypothesis  with  its  prototypical  findings  categorized  accord¬ 
ing  to  the  role  they  play  in  the  generation  and  selection  of  the  hypotheses.  The 
matching  of  an  observed  finding  with  one  of  the  TRIGGERS  of  a  hypothesis  causes  it 
to  be  activated  immediately.  As  reflected  by  their  names,  IS-SUFFICIENT,  MUST- 
HAVE  and  MUST-NOT-HAVE  findings  (not  often  available)  are  used  to  make  logical 
decision  as  to  whether  the  hypothesis  can  be  accepted  or  rejected.  The  complemen¬ 
tary  hypotheses,  classified  as  CAUSED-BY,  CAUSE-OF,  COMPLICATED-BY, 
COMPLICATION-OF  or  ASSOCIATED-WITH  according  to  the  nature  of  the  association, 
identify  other  disorders  which  may  be  necessary  in  addition  to  the  hypothesis  under 
consideration  to  account  for  the  condition  of  the  patient.  On  the  other  hand,  the 
DIFFERENTIAL-DIAGNOSIS  relationships  are  used  to  control  the  activation  of  compet¬ 
ing  hypotheses.  An  example  of  a  PIP  disease  frame  can  be  found  in  Figure  1.4. 8-3. 
Inside  the  knowledge  base  are  also  demon  procedures  that  are  invoked  to  character¬ 
ize  the  input  findings  in  detail. 

In  PIP’s  scheme  of  hypothesis  activation  and  confirmation,  the  notion  of  state 
transitions  is  introduced.  Initially,  all  hypotheses  are  inactive  until  their  possibility 
is  suggested  by  some  reported  findings.  As  mentioned  above,  if  a  trigger  of  any 
hypothesis  is  observed,  that  hypothesis  is  made  active  and  all  of  its  related  comple¬ 
mentary  hypotheses  become  semi-active.  Competing  hypotheses  related  in  the 
DIFFERENTIAL-DIAGNOSIS  are  also  semi-activated  if  the  appropriate  condition  holds. 
A  hypothesis  is  said  to  be  accepted  when  an  estimate  of  its  likelihood  is  sufficiently 
high  and  the  hypothesis  is  inactivated  if  it  drops  below  some  other  threshold.  The 
computation  of  the  likelihood  of  a  hypothesis  is  performed  in  two  steps.  Using  the 
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numerical  likelihood  estimator,  a  local  score  is  computed  as  the  sum  of  the  values  of 
the  clauses  normalized  by  the  maximum  possible  total  score.  A  matching  score  is 
then  computed  by  adding  up  contributions  from  all  possibly  complementary 
hypotheses  and  again  normalizing  by  the  maximum  possible  total. 

We  see  that  a  richness  of  diagnostic  concepts  is  incorporated  in  PIP.  Rubin  had 
furthur  introduced  IS-A/CH01CE-0F  relations  and  REJECT  findings  in  her  enhance¬ 
ment  of  the  system  [Rubin75].  The  program's  behaviour  reasonably  resembles  that 
of  the  clinicians  in  many  cases.  However,  its  ability  to  handle  multiple  diagnoses  is 
limited  to  closely  related  diseases.  It  also  suffers  from  the  problems  of  deciding 
when  to  terminate  and  handling  rapid  shifts  of  focus.  Although  the  scoring  function 
provides  a  general  means  of  evaluating  the  significance  of  present  and  absent 
findings,  the  lack  of  "physical  significance"  in  the  numbers  used  makes  it  difficult  to 
be  set  up  and  tuned.  There  is  also  a  potentially  combinatorially  large  number  of 
clauses  in  the  estimator  rendering  it  quite  awkward  to  use. 
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TRIGGERS  (EDEMA  with  LOCATION  =  FACIAL  or  PERI-ORBITAL, 

PAINFULNESS  =  not  PAINFUL,  SYMMETRY  =  not 
ASYMMETRICAL.  ERYTHEMA  =  not  ERYTHEMATOUS) 

FINDINGS  (COMPLEMENT  with  RANGE  =  LOW),  (MALAISE).  (WEAKNESS). 

(ANOREXIA).  (EDEMA  with  SEVERITY  =  not  MASSIVE). 

(PATIENT  with  AGE  =  CHILD  or  YOUNG,  SEX  =  MALE) 

CAUSED-BY  (STREPTOCOCCAL-INFECTION  in  RECENT-PAST) 

CAUSE-OF  SODIUM-RETENTION,  ACUTE-HYPERTENSION,  NEPHROTIC-SYNDROME, 
C-LOMERULmS 

COMPUCATED-BY  ACUTE-RENAL-FAILURE 
COMPLICATION- OF  CELLULITIS 

DIFFERENTIAL-DIAGNOSIS 

(CHRONIC-HYPERTENSION  implies  CHRONIC-GLOMERUIJTIS) 

(EDEMA  with  RECURRENCE  =  not  FIRST-TIME 

implies  NEPHROTIC-SYNDROME,  CHRONIC-GLOMERULONEPHRITIS. 
FOCAL-GLOMERULONEPHRITIS) 

(ABDOMINAL-PAIN  implies  HENOCH-SCHOENLEIN-PURPURA) 

(RASH  with  PURPURA  =  PURPURIC  implies  HENOCH-SCHOENLEIN-PURPURA) 

(RASH  with  (either  LOCATION  =  MALAR  or  PHOTOSENSITIVITY  =  PHOTO¬ 
SENSITIVE)  implies  SYSTEMATIC-LUPUS) 

(JOIN-PAIN  implies  HENOCH-SCHOENLEIN-PURPURA,  SYSTEMIC-LUPUS) 

SCORE 

(((PATIENT  with  AGE  =  CHILD  or  YOUNG)  -v  0.8) 

((PATIENT  with  AGE  =  MIDDLE-AGED)  -  -0.5) 

((PATIENT  with  AGE  =  OLD)  ->  -1.0)) 

(((COMPLEMENT  with  RANGE  =  LOW)  -»  1.0) 

((COMPLEMENT  with  RANGE  =  NORMAL  or  MODERATELY-ELEVATED)  -+  -0.7) 
((COMPLEMENT  with  RANGE  =  VERY-HIGH)  -+  -1.0)) 

(((EDEMA  with  LOCATION  =  FACIAL  or  PERI-ORBITAL,  SYMMETRY  =  not 

ASYMMETRICAL,  DAILY-TEMPORAL- PATTERN  -  WORSE-IN-MORNING. 
PAINFULNESS  =  not  PAINFUL,  ERYTHEMA  =  not  ERYTHEMATOUS)  -*  1.0) 
((EDEMA  with  LOCATION  =  FACIAL  or  PERI-ORBITAL,  SYMMETRY  =  not  ASYMMETRICAL, 
PAINFULNESS  =  not  PAINFUL,  ERYTHEMA  =  not  ERYTHEMATOUS)  -  0.5) 
((EDEMA  with  SEVERITY  =  not  MASSIVE)  ->  0. 1) 

((EDEMA  with  SEVERITY  =  MASSIVE)  -»  -1.0)) 

(((PATIENT  with  SEX  =  MALE)  ->  0.3)  ((PATIENT  with  SEX  =  FEMALE)  -»  -0.3)) 
(((ANOREXIA)  -  0.3)  ((ANOREXIA  absent)  -♦  -0.3)) 

(((WEAKNESS)  ->  0.3)  ((WEAKNESS  absent)  -  -0.3)) 


Figure  1. 4.2-3  The  PIP  Hypothesis  Frame  for  Acute  Glomerulonephritis 
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Summary 


We  have  described  three  AI  approaches  to  computer-aided  medical  diagnosis,  in 
which  different  aspects  of  the  problem  are  emphasized  and  exploited.  They  are  the 
data  structure  and  organization  (MYCIN),  diagnostic  information  (PIP),  and  control 
strategies  (INTERNIST).  Yet  another  approach,  based  on  the  causal  modeling  of  the 
disease  glaucoma,  has  been  under  taken  by  the  group  at  Rutgers  University 
[Weiss77,  78],  In  this  specialized  application,  tests  are  available  to  map  relation¬ 
ships  between  observations,  pathophysiological  states,  and  disease  categories,  and 
they  are  organized  into  a  Causal  Association  NETwork  (termed  CASNET).  The  pro¬ 
gram  works  well  because  its  domain  is  narrow  and  the  underlying  mechanism  is 
well-understood,  which  is  unfortunately  generally  not  true.  Despite  the  initial 
successes  of  the  current  programs,  we  must  note  that  many  important  and  essential 
aspects  of  medical  diagnosis  have  not  be  explored.  Some  of  these  will  be  addressed 
in  the  last  chapter. 

1.5.  Brief  overview  of  our  approach 

In  our  design  we  attempt  to  bring  together  the  positive  points  of  several  exist¬ 
ing  systems  while  trying  to  avoid  or  to  devise  solutions  to  problems  they  have 
encountered.  A  rich  collection  of  medical  information  used  by  these  systems  are 
incorporated  in  our  knowledge  base,  centered  around  disease  categories  and,  more 
generally,  clinical  status  of  a  patient.  Knowledge  is  represented  as  frames,  organ¬ 
ized  using  concepts  from  semantic  network  theory  [Brachman79],  particularly  the 
PSN  formalism  [Levesque?9].  The  basic  entities  of  this  representation  are  classes 
and  (binary)  relations,  and  their  semantics  are  provided  by  a  small  set  of  programs. 
Two  organizational  principles  are  offered  by  the  representation:  the  IS-A  and  PART- 
OF  hierachies,  which  allow  the  specification  of  generalization  and  whole-part 
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relationships,  and  thus  varying  levels  of  abstraction  of  details.  The  concept  of  a 
metaclass,  that  is  a  class  of  classes,  can  be  used  to  explain  certain  features  of  the 
representation  within  itself,  thus  providing  a  means  of  representing  knowledge  about 
representation.  This  uniformity  of  representation  is  exploited  in  the  design  of  a 
module  for  knowledge  acquisition. 

The  disease  frames  are  interconnected  via  several  sets  of  relations  to  form  an 
associative  network.  The  organization  of  data  is  largely  impartial  in  the  sense  that 
it  imposes  little  constraint  on  the  design  of  the  control  structure,  and  it  allows  easy 
retrieval  of  information  and  examination  of  the  knowledge  base.  Another  important 
feature  of  the  representation  is  the  use  of  attributes  to  characterize  specific 
instances  of  a  disease  class.  Not  only  does  it  alleviate  the  difficulty  with  prolifera¬ 
tion  of  frames,  it  also  provides  guidance  for  user’s  input.  The  notion  of  a  clinical 
condition  is  introduced  and  is  used  extensively  as  a  tool  for  aggregating  closely 
related  findings  together  and  "dynamically"  adjusting  the  knowledge  base  to  the  par- 
ticalur  case  under  consideration. 

Relationships  among  the  frames  are  associated  with  measures  of  the  strength  of 
association,  which  are  defined  in  terms  of  MYCIN’s  certainty  factors.  These  parame¬ 
ters  form  the  basis  of  a  scheme  for  maintaining  likelihood  estimates  of  the  disease 
hypotheses,  evaluated  on  both  how  well  they  match  the  reported  findings  locally  and 
how  well  they  fit  in  a  global  disease  model.  This  evaluation  scheme  allows  us  to 
implement  a  very  flexible  control  structure  based  on  the  hypothesize-and-test  para¬ 
digm.  Various  case-building  tactics,  cost-benefit  considerations  and  measures  to 
generate  more  coherent  reasoning  are  provided. 

We  begin  in  the  next  chapter  by  describing  the  most  essential  component  of  the 
system,  namely  the  diagnostic  information.  We  shall  try  to  motivate  what  we  have 
included  in  the  knowledge  base  and  the  way  it  is  organized.  The  representation  is  to 
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a  large  extent  declarative  (cf.  procedural  [Winograd75]).  In  chapter  3  we  define  the 
actual  semantics  of  each  constituent  by  showing  how  it  is  applied  to  the  decision 
making  processes,  with  a  control  structure  which  in  many  respects  resembles  the 
reasoning  styles  observed  in  the  clinician.  The  model  of  clinical  reasoning  on  which 
this  is  based  will  also  be  discussed.  Chapter  4  describes  some  of  our  efforts  to 
make  the  system  a  flexible  and  useful  tool  that  is  pleasant  to  use.  Finally,  we 
present  our  conclusions  and  discuss  some  directions  for  further  research. 
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Chapter  2 


Knowledge  Representation 


2.1.  Introduction 

From  the  brief  discussion  of  several  attempts  at  automating  the  process  of 
medical  diagnosis,  we  see  a  great  variety  in  the  information  used  by  the  systems 
and  schemes  for  representing  and  organizing  it.  Most  of  the  systems  which  have 
taken  an  AI  approach  are  designed  to  mimic,  as  closely  as  possible,  the  reasoning  of 
an  expert  physician  in  making  a  diagnosis.  This  is  so  that  users  from  the  medical 
community  wil  be  able  to  understand  the  decision  making  process  of  the  program 
and  be  willing  to  accept  its  advice. 

In  designing  such  a  system  we  may  therefore  want  to  first  ask  a  physician  the 
question,  "How  do  you  make  a  medical  diagnosis?"  His  explanation  of  the  process 
might  go  as  follows.  "I  first  obtain  the  case  facts  from  the  patient’s  history,  physi¬ 
cal  examination  and  preliminary  laboratory  tests.  Then,  I  evaluate  the  relative 
importance  of  the  different  signs  and  symptoms,  some  of  which  are  more  important 
than  others.  In  order  to  make  a  differential  diagnosis  I  list  all  the  diseases  which 
the  specific  case  can  reasonably  resemble.  I  then  exclude  one  disease  after  another 
from  the  list,  gathering  more  data  when  necessary,  until  it  becomes  apparent  that 
the  case  can  be  fit  into  a  definite  disease  category  or  one  of  several  possible 
diseases,  or  else  that  its  exact  nature  cannot  be  determined.”  This,  obviously,  is  a 
greatly  simplified  explanation  of  the  process  of  diagnosis,  for  the  physician  might 
also  comment  that  after  seeing  a  patient  he  often  has  a  "gut  feeling  about  the 
case",  which  may  come  from  experience  or  simply  "intuition”.  The  "feeling" 
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although  hard  to  explain  may  be  the  summation  of  his  impressions  concerning  the 
way  the  data  seem  to  fit  together,  the  patient’s  reliability,  general  appearance, 
facial  expression,  and  so  forth.  The  physician  might  also  add  that  such  thoughts  do 
influence  the  considered  diagnoses.  There  is  no  doubt  that  complex  reasoning 
processes  are  involved  in  making  a  medical  diagnosis  [Ledley59]. 

However,  in  the  design  of  our  system,  no  attempt  was  made  to  analyze  the 
human  cognitive  process  in  every  detail  and  to  duplicate  the  steps  a  physician 
makes  in  arriving  at  a  diagnosis,  because  such  an  analysis  and  duplication  would  be 
impractical,  if  at  all  possible.  There  is  no  generally  accepted  "scientific''  approach 
to  medical  diagnosis,  often  referred  to  as  an  art  developed  mainly  from  experience. 
Rather,  only  some  "natural"  and  thus  easily  understandable  inherent  concepts  are 
embodied  in  the  scheme  used,  in  an  attempt  to  provide  the  physician  with  some 
means  of  expressing  his  thought  processes  and,  in  return,  understanding  the  reason¬ 
ing  processes  of  the  system. 

In  a  pioneering  paper  [Ledley59]  Lediey  and  Lusted  discussed  the  reasoning 
foundations  of  medical  diagnosis.  From  careful  analysis  of  many  reported  clin¬ 
ic  op  atholog  iced  exercises,  they  suggest  three  basic  concepts  that  are  inherent  in  any 
medical  procedure,  even  when  the  diagnostician  utilizes  them  subconsciously  or  on 
an  "intuitive"  level.  They  include 

(i)  Logic:  This  refers  to  medical  knowledge  that  presents  certain  information  about 

relationships  that  exist  between  the  symptoms  and  the  diseases,  express- 
able  as  Boolean  functions. 

(ii)  Probability:  This  is  essential  since  clinical  decision  making  is  not  an  exact  and 

deterministic  process.  It  involves  various  sorts  of  uncertainties  both  in  the 
data  and  symptom-disease  relations.  Although  medical  diagnostic  text- 
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books  rarely  give  numerical  values,  words  such  as  "frequently",  "very 
often",  and  "almost  always"  are  often  found. 

(iii)  Value  theory:  This  is  involved  in  value  decisions  made  in  complicated  conflict 
situations,  for  example,  in  choosing  a  test  procedure  or  a  therapy  plan. 
Before  a  test  is  carried  out,  its  diagnostic  value,  the  danger  that  the 
patient  plight  be  subject  to.  turn-ar.ound  time,  and  perhaps  .econqmics  are 
considered. 

As  will  be  seen  later,  these  concepts  do  appear  in  our  system  either  explicitly  in  the 
knowledge  representation  or  implicitly  in  the  control  algorithms.  They  provide  the 
physician  a  means  of  explaining  analytically  the  reasoning  steps  that  he  uses  when 
providing  clinical  consultations.  There  are  several  more  recent  studies  attempting  a 
careful  scrutiny  of  the  elements  of  problem  solving  and  clinical  decision  making 
[Newell72]  [Elstein76,  78]  [Kassirer78]  [Barrow78].  Results  from  these  works  will  be 
referenced  from  time  to  time  in  the  following  discussion. 


2.2.  Representational  concepts 

Having  discussed  what  to  represented,  the  next  question  is  how  it  should  be 
represented.  This  has  been  an  important  field  of  AI  research  and  various  methodolo¬ 
gies  have  been  developed  for  representing  knowledge  for  different  domains. 
[Barr8Q]  and  [MylopoulosQO]  provide  fairly  exhaustive  review  of  this  area. 

Before  deciding  on  a  formalism  for  storing  knowledge  in  a  data  base,  we  should 
identify  all  the  different  types  of  information  that  are  required,  examine  how 
different  pieces  of  data  are  related  to  each  other,  and  how  and  when  each  will  be 
utilized.  Only  then  can  we  choose  a  scheme  for  organizing  and  structuring  the  data 
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so  that  easy  retrieval  of  information  is  possible.  In  the  case  of  a  large  knowledge 
base,  storage  efficiency,  maintainability,  and  data  consistency  must  also  be  taken 
into  consideration.  Although  the  production  formalism  is  attractively  simple,  it  is 
often  awkward  to  encode  knowledge  in  its  restrictive  format.  In  this  section  we  shall 
point  out  its  inadequacies  for  our  application  domain  and  motivate  the  use  of  a 
frame  structure  [Minsky75]. 

The  diagnosis  decision  will  mainly  be  made  on  the  basis  of  information  of  the 
form: 

there  is  X%  belief  that  the  presence  (absence)  of  the  set  S  of  one  or  more  symp¬ 
toms  suggests  that  the  patient  has  (has  not)  disease  Dm. 
and  to  a  lesser  extent  on  how  well  different  diseases  fit  in  with  each  other  (also  for¬ 
mulated  as  statements  similar  to  the  one  above).  It  might  then  seem  natural  to 
adopt  production  rules  as  a  representation  of  the  knowledge  base.  Each  symptom 
will  be  treated  as  a  premise  and  the  disease  category  as  the  conclusion,  while  the 
parameter  X  can  be  readily  incorporated  in  a  certainty  factor  associated  with  the 
rule.  However,  the  rule  as  it  stands  above  indicates  that  all  the  symptoms  in  51  are 
necessary  conditions  of  disease  Dm,  which  is  often  not  the  case.  What  is  needed  is  a 
set  of  rules  with  different  combinations  of  the  symptoms  as  their  premises,  but  this 
leads  to  a  combinatorial  explosion  of  data.  However,  studies  show  that  h3'potheses 
are  generated  from  a  very  few  cues  at  most  [Elstein78]  [Kassirer78].  It  is  felt  that 
in  many  cases  the  symptoms  are  to  a  large  extent  independent  and  each  contri¬ 
butes  a  different  degree  of  certainty  to  the  confirmation  of  the  presence  of  the 
disease.  A  consequence  is  that  the  composite  rule  can  be  broken  up  into  separate 
rules  each  with  a  single  premise  and  put  to  use  independently.  Means  must  however 
be  provided  to  allow  grouping  of  strongly  dependent  findings.  Of  course,  this  decom¬ 
position  technique  can  be  accomplished  by  using  production  rules  as  mentioned 
above.  This  will,  however,  result  in  a  total  disintegration  of  the  structure  of  the 
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knowledge  base,  not  to  mention  the  wasteful  repetition  of  some  conclusions  in  many 
rules. 

In  a  research  project  involving  a  MYCIN-like  production  system  PUFF  [Kunz78], 
written  to  perform  pulmonary  function  test  interpretations,  problems  such  as  a  need 
to  focus  the  search  for  new  information  and  the  desire  to  represent  characteristic 
patterns  of  disease  were  encountered.  This  motivated  the  development  of  a 
prototype-directed  system  called  CENTAUR  [Aikins79].  It  uses  a  knowledge  represen¬ 
tation  that  combines  both  production  rules  and  frame-like  structures  called  proto¬ 
types.  Each  prototype  characterizes  some  pulmonary  disease  and  represents  stereo¬ 
typical  situations  which  can  be  used  as  a  basis  for  comparison  to  the  actual  situa¬ 
tion  given  by  the  data.  The  prototypes  guide  the  invocation  of  the  production  rules 
and  focus  the  search  for  new  information.  They  are  also  linked  together  in  a  net¬ 
work  in  which  the  links  specify  the  relationships  between  them.  This  scheme  with 
its  added  flexibility  proved  to  be  fairly  successful,  at  the  expense  of  much  of  the 
simplicity  of  the  production-rule  version. 

In  our  work,  the  rich  collection  of  knowledge  is  also  organized  into  frames, 
packages  of  closely  related  facts.  Gathered  in  each  frame  is  all  the  information 
required  for  each  basic  finding  including  signs,  symptoms  and  laboratory  data  (e.g. 
fever,  erythema-marginatum),  intermediate  clinical  or  physiological  state  (e.g. 
rheumatic- carditis),  and  an  individual  disease  (e.g.  acute-rheumatic-fever)  or  a  class 
of  diseases  (e.g.  streptococcal-infection),  hereafter  collectively  referred  to  as  a 
disease  entity.  Examples  of  these  data  units  are  given  in  Figures  2.3-1  through  2.3- 
5.  They  are  neither  complete  nor  definite,  but  are  merely  intended  for  illustrating 
our  representation  of  diagnostic  and  therapeutic  information.  Each  frame  may  have 
an  associated  description  of  its  internal  structure  that  defines  the  semantic  com¬ 
ponents  of  the  concept  that  the  frame  represents.  The  representation  is  based  on 
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semantic  networks  and  specifically  on  the  PSN  formalism  [Levesque76].  The  internal 
structure  of  a  frame  is  defined  by  slots,  each  of  which  has  a  name  and  an  associated 
piece  of  information  describing  how  the  slot  should  be  applied.  Slots  arc  classified 
into  several  categories  depending  on  the  functions  they  serve.  Manifestation  slots 
define  the  accountable  findings  together  with  their  expected  characteristics.  They 
provide  information  on  what  is  expected  from  the  disease  models  hypothesized  by 
the  system,  and  on  how  they  should  be  adjusted  on  the  failure  of  an  expectation. 
Evidence-of  slots,  on  the  other  hand,  give  information  about  the  possible  causes  of  a 
finding  or  collection  of  findings.  They  are  therefore  links  between  disease  entities  in 
the  opposite  direction  of  the  expectation  links  just  mentioned,  and  provide  informa¬ 
tion  for  the  generation  of  disease  hypotheses.  A  final  set  of  entity-relationship  links 
are  represented  as  association  slots.  Generally,  they  indicate  the  compatibilities 
between  disease  entities,  that  is,  how  they  can  be  fit  together  into  a  "disease  com¬ 
plex",  possibly  with  temporal  information  on  these  relations.  Also  included  in  a 
frame  are  attribute  slots  for  the  characterization  of  the  entity,  and  treatment  slots 
for  the  selection  of  potentially  useful  therapeutic  procedures.  Other  information, 
such  as  the  importance  of  the  sign/disease  and  data  for  bookkeeping  purposes,  are 
conveniently  attached  to  the  frames.  Finally,  general  disease  categories  and  their 
specializations  are  related  via  is-a  links.  The  is-a  relationship  is  a  partial  order  rela¬ 
tion  and  defines  a  hierarchy  along  which  frames  can  be  organized.  It  also  implies 
inheritance  of  properties  from  the  parent  to  the  child  nodes,  and  thus  allows 
hypotheses  to  be  generated  at  a  general  level  and  refined  subsequently. 

Rules  in  explicit  form  are  done  away  with  and  are  embedded  in  the  different 
types  of  links.  Associated  with  these  links  are  the  clinical  conditions  under  which 
they  can  be  applied,  thus  providing  a  means  for  dynamically  adjusting  the  set  of 
"rules"  in  our  knowledge  base  to  the  particular  case  under  consideration.  Since  we 
have  "rules”  working  in  both  directions  between  conceptual  entities,  both  the 
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hypothesis-directed  and  data-directed  approaches  to  problem  solving  (c/.  backward- 
and  forward-chaining  in  production  systems)  can  be  taken  in  our  control  structure 
for  the  diagnosis  process.  Although  the  knowledge  domain  of  interest  here  is  res¬ 
tricted  to  only  cardiovascular  diseases,  we  seek  a  representation  that  is  flexible 
enough  to  be  applicable  to  a  wider  field.  Information  used  by  different  existing  simi¬ 
lar  systems  is  examined  and  is  incorporated  into  our  scheme.  While  trying  to  design 
a  scheme  as  flexible  as  possible,  the  need  for  uniformity  and  simplicity  is  kept 
firmly  in  mind.  In  the  balance  of  this  chapter  we  shall  describe  the  functions  of 
each  constituent  of  a  frame  in  detail  and  show  how  detailed  medical  information  can 
be  represented  in  our  formalism.  We  shall  see  that  although  a  disease  entity  can 
mean  several  things,  they  are  different  levels  of  abstraction  of  the  same  concept, 
and  that  they  play  similar  roles  in  our  model  of  clinical  decision  making  that  will  be 
discussed  in  the  next  chapter.  The  uniform  data  and  control  structures  make  the 
implementation  of  simple  subsystems  for  data  acquisition  and  question-answering 
possible.  They  will  be  described  in  detail  in  chapter  4. 
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frame  acute-rheumatic-fever  ,  AJRF 
instance-of  disease-entity  where  ( 

author  <- 

"Steve  Ho-Tai"  , 


date  «- 

"20-10-80"  , 
source  «- 

"[1]  126-129,  [2]  497-506,  [6]  1237-1243"  , 


type  «- 

u Itimate  , 
description  «- 

"fever  occurring  during  recovery  from  infection,  usually  of  the  throat,  with 
group  A  streptococci;  it  occurs  in  children  and  young  adults,  and  is  variably 
associated  with  acute  migratory  polyarthritis,  Sydenham’s  chorea,  subcutane¬ 
ous  nodules  over  bony  prominences,  myocarditis  with  formation  of  Aschoff 
bodies,  which  may  cause  acute  cardiac  failure,  and  endocarditis  which  is  fre¬ 
quently  followed  by  scarring  of  valves,  causing  stenosis  or  incompetence."  , 


important  «- 

Syndenham’s-chorea  ) 


with 

manifestations 

mOl  :  patient  such  that  (patient. age  ^  4  and  patient. age  3*  1C) 
usually  present  ; 

m02  :  arthritis  such  that  (arthritis. multiple-joint  =  yes  and 
arthritis. migratory  =  yes)  often  present  ; 
m03  :  rheumatic-carditis  sometimes  present  ; 
m04  :  subcutaneous-nodules  sometimes  present  ; 
m05  :  erythema-marginatum  sometimes  present  ; 
m06  :  arthralgia  sometimes  present  ; 
m07  r  Sydenham’s  chorea  rarely  present  ; 

associations 

aOl  :  ARF-history  often  precedes  ; 

a02  :  group-A-streptococcal-infeetion  always  precedes 
when  (no  Sydenham’ s-chorea)  ; 

treatment 

tOl  :  general  where  (instruction  <-  "bed  rest  until  the  patient 
is  afebrile,  gaining  weight  and  the  ESR  is  normal")  ; 
t02  :  prenisolone  where  (dosage  <-  "10-20  mg  daily") 
when  rheumatic-carditis  ; 

t03  :  aspirin  where  (dosage  «-  "80  mg/kg  body  weight  daily”  , 

goal  «-  "maintain  serum  salicylate  level  at  30mg/100ml") 
when  (no  rheumatic-carditis)  ; 

end 


Figure  2.2-1  Frame  structure  of  an  ultimate  etiology 
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frame  group- A-streptococcal-infection  is-a  streptococcal  infection 


instance-of  disease-entity  where  ( 

author  «- 

"Steve  Ho-Tai"  , 


date  «- 

”11-10-80"  , 


type  «- 

ultimate  , 


description  <- 

"Infection  caused  by  an  organism  belonging  to  the  genus  Streptococcus.  Beta 
hemolytic  reaction  on  blood  agar  ..."  ) 


with 


associations 

aOl  :  pharyngitis-tonsillitis  often  follows  ; 
a02  :  pyoderma  sometimes  follows  ; 
a03  :  acute-rheumatic-fever  rarely  follows  ; 
a04  :  glomerulonephritis  rarely  follows  ; 

end 


Figure  2.2-2  Frame  structure  of  a  specific  disease 
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frame  streptococcal-infection  is-a  infectious-disease 


instance-of  disease-entity  where  ( 

author  «- 

"Steve  Ho-Tai”  , 


date  *- 

"20-10-80”  , 


source  *- 

"[7]  172-185”  , 


type  *- 

ultimate  , 


description  <- 

"Infection  caused  by  any  organism  of  the  genus  Streptococcus,  containing 
Gram-positive,  spherical  or  ovoid  cells  which  occur  in  pairs  or  chains.  Nor, mo¬ 
tile.  Aerobic,  facultatively  anaerobic."  ) 


with 


manifestations 

mOl  :  elevated-ASO  always  present 
when  (no  penicillin)  ; 
m02  :  elevated-ESR  always  present 

when  (no  corticosteroid  and  no  salicylate)  ; 
m03  :  fever  usually  present  ; 
m04  :  sore- throat  usually  present  ; 

attributes 

blood-agar-hemolysis  :  value  ; 


evidence- of 

eOl  :  group-A-streptococcal-infection  strongly  suggested 

when  (streptococcal-infection.blood-agar-hemolysis  =  beta )  ; 


treatment 

tOl  ;  procaine-penicillin-G  where  (dosage  <  "60000C  units  twice  daily"  , 
duration  <-  "6  days")  ; 


end 


Figure  2.2-3  Frame  structure  of  a  disease  class 
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frame  rheumatic-carditis 


instance-of  disease- entity  where  ( 

author  «- 

"Steve  Ho-Tai"  , 


date  <- 

"20-10-80''  , 


type  «- 

intermediate  , 

description  t 

"This  indicates  cardiac  involvement  of  acute  rheumatic  fever.  First  manifested 
by  the  appearance  of  organic  heart  murmurs,  most  commonly  those  of  mitral 
or  aortic  regurgitation.  Signs  and  symptoms  of  pericarditis  and  of  congestive 
heart  failure  may  supervene  in  more  severe  cases.  Prolongation  of  F-R  interval 
in  ECG  is  common."  , 


condition  <- 

(endocarditis  or  myocarditis  or  pericarditis)  ) 


with 


evidence-of 

eOl  :  acute-rheumatic-fever  strongly  suggested  ; 


end 


Figure  2.2-4  Frame  structure  of  a  clinical  state 
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frame  erythema- marginatum  is-a  erythema 


instance-of  disease- entity  where  ( 

author  «- 

"Steve  Ho-Tai"  , 


date  «- 

"20-10-80"  . 


source  <- 


n 


[1]  127"  . 


type  «- 

initial  , 


description  «- 

"Evanescent  pink  rash  on  trunk  and  proximal  part  of  the  extremities,  never  on 
the  face.  The  erythematous  areas  have  pale  centres  and  serpiginous  margins 
without  induration."  , 


important  <- 

always  ) 


with 


attributes 

location  :  choice  where  (datum  <-  erythema.location)  ; 
symmetrical  :  yes-no  where  (datum  <-  erythema. symmetrical)  ; 


evidence- of 

eOl  :  acute-rheumatic-fever  strongly  suggested  ; 


end 


Figure  2.2-5  Frame  structure  of  a  basic  finding 
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2.3.  Frame  contents 


The  medical  information  required  for  the  diagnostic  procedures  is  grouped  into 
frames  --  knowledge  units  centered  around  disease  entities.  An  extended  BNF 
description  of  the  full  syntax  of  the  frame  structure  can  be  found  in  Appendix  I. 
This  serves  as  a  way  of  laying  out  the  information  in  a  concise  and  comprehensive 
form.  Note  that  in  the  representation,  we  have  distinguished  the  information  associ¬ 
ated  with  a  disease  entity  (e.g.  general  description)  from  that  associated  with  a 
specific  occurrance  of  the  entity  (e.g.  observed  manifestations).  Their  meanings 
and  functions  will  be  individually  described,  furnished  with  examples  to  motivate  the 
need  for  each  constituent.  Details  about  the  representational  structures  will  be  dis¬ 
cussed  in  the  section  on  data  acquisition  in  chapter  4. 

author,  date 

Author  and  date  of  creation  is  automatically  tagged  to  the  frame  structure  by 
the  system  upon  its  creation. 

source 

This  is  the  reference  or  person  from  which  the  information  is  acquired.  This  is 
the  place  the  user  should  refer  to  when  he  finds  the  frame  contents  questionable. 

This  bookkeeping  information  provides  a  useful  form  of  documentation  in  keeping 
track  of  a  large  and  growing  knowledge  base. 
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type 


This  is  the  nature  of  the  disease  entity,  which  can  be  labelled  one  of  the  follow¬ 
ing: 


(i)  ultimate  —  for  those  that  can  be  considered  ultimate  etiologies  and  whose  causes 

need  not  be  searched  for,  or  are  not  known,  before  a  treatment  can  be 
selected.  An  example  is  left-ventricular-failure  ( LVF ). 

(ii)  intermediate  —  for  those  intermediate  pathophysiological  states  and  general 

disease  categories  which  may  be  suggested  by  their  corresponding  sets  of 
manifestations,  but  their  causes  or  specializations  must  be  known  before 
they  can  be  properly  treated.  Pulmonary -edema,  which  is  a  manifestation 
of  LVF ,  is  labelled  as  such.  It  can  also  be  neurogenic  in  which  case 
therapy  of  a  different  nature  should  be  considered. 

(iii)  others  —  for  non-cardiovascular  disease  entities.  There  is  no  information  on 

these  diseases  in  the  knowledge  base,  and  so,  confirmation  and  treatment 
recommendation  cannot  be  performed  for  them. 

(iv)  types  of  basic  findings  -- 

One  of  these  is  initial  for  initial  data,  such  as  those  from  patient’s  history 
and  physical  examination,  which  are  expected  to  be  input  in  response  to 
the  system’s  initial  request  for  data.  Another  is  specific  for  those  findings 
that  cannot  be  expected  to  be  input  by  the  user  on  his  own  initiation. 
Examples  are  slurred-speech  and  tongue-biting  --  findings  that  may  not  be 
noticed  unless  specifically  looked  for.  Other  types  designate  the  lab  pro¬ 
cedures  corresponding  to  the  findings.  Examples  of  this  type  include  ECO 
for  e.g.  prolong ed-PR-interval  and  atrioventricular-block\  and  auscultation 
for  e.g.  pulmonnry-ej ection-sound  and  paradoxical-SZ-split. 
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The  distinction  between  these  different  types  is,  unfortunately,  not  very  clear- 
cut.  For  instance,  anemia  with  its  abundance  of  fairly  strong  supporting  symptoms 
may  be  considered  an  intermediate  state.  It  is  however  given  the  type  blood-test 
since  a  blood  count  can  readily  provide  a  solid  confirmation.  Intermediate  is  in  fact 
the  default  label  given  to  those  entities  which  do  not  belong  to  any  particular  data 
group  and  cannot  be  classified  as  ultimate.  An  understanding  of  how  the  informa¬ 
tion  is  put  to  use  will  help  to  clarify  the  case. 

For  lab  findings,  type  is  used  when  requesting  data  from  the  user.  Instead  of 
asking  for  a  specific  sign  or  test  result,  its  type  is  printed  to  suggest  the  general 
test  procedure  to  be  done.  In  the  case  where  the  test  interpretation  is  to  be  car¬ 
ried  out  by  a  subsystem,  it  can  be  set  into  action  through  a  procedure  associated 
with  its  type.  Currently  there  are  only  two  simple  subsystems,  one  for  checking  if 
the  patient  has  a  previous  history  of  a  specified  disease,  and  the  other  for  checking 
if  a  specified  medication  has  been  taken  recently.  More  will  be  said  about  this  sys¬ 
tem  interface.  Also  attached  to  these  type s  are  indices  which  roughly  separate 
types  into  different  cost  levels  according  to  the  expense  required  to  run  the  test  or 
subsystem,  and  according  to  the  danger  that  the  patient  will  be  subject  to  as  well. 
At  the  bottom  we  find  the  initial  type  and  ascultation.  Blood-test  will  be  found 
higher  up,  and  invasive  techniques  such  as  catheterization  higher  still.  They,  to 
some  extent,  determine  the  order  in  which  the  tests  will  be  performed.  Generally, 
tests  with  a  lower  cost  will  be  carried  out  first. 

description 

This  is  a  concise  discription  of  the  characteristics  of  the  disease  entity.  It  may 
usefully  serve  as  a  definition  when  the  user  is  in  doubt. 
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attributes 


Attributes  are  properties  associated  with  the  disease  entities,  such  as  the  ori¬ 
gin,  radiation,  quality,  duration  of  chest-pain,  which  provide  diagnostic  information 
for  the  possible  causes  of  the  entities.  Mot  only  does  the  use  of  attributes  avoid 
proliferation  of  frames  by  not  having  one  for  each  entity  with  a  certain  set  of 
characteristics  (c/.  INTERNIST),  but  also  it  provides  guidance  for  the  user  to  charac¬ 
terize  his  input  findings  and  helps  to  alleviate  the  difficulty  of  providing  the  informa¬ 
tion  in  conformity  with  the  system’s  knowledge.  Each  of  the  attributes  is  given  an 
attribute  type  which  provides  a  general  way  of  expressing  instructions  to  the  system 
on  how  to  find  the  required  information.  This  small  set  of  "input  drivers"  together 
play  the  role  of  the  routines  in  PIP  for  the  characterization  of  the  mput  findings. 
The  approach  we  use  differs  slightly  from  the  one  more  typically  taken,  which  relies 
on  a  demon-like  mechanism  that  uses  the  full  power  of  the  underlying  programming 
language.  We  have  avoided  the  use  of  an  arbitrary  body  of  code  and  emphasized 
instead  the  use  of  a  task  specific  high  level  language.  There  are  currently  three 
such  instructions  that  can  be  used: 

(i)  yesmo  --  to  be  obtained  from  the  user  as  a  yes/no  response,  e.g.  the  symmetri¬ 

cal  slot  of  the  edema  frame. 

(ii)  choice  --  to  be  requested  from  the  user  as  a  choice  among  the  possible 

responses,  e.g.  origin  of  erythema. 

(ill)  value  —  atomic  value  with  a  generally  accepted  terminology  to  be  supplied  by 
the  user,  e.g.  blood-agar-hemolysis  of  streptococcal-infection  (possible 
values  are  alpha ,  beta  and  no)  and  numerical  lab  data. 

These  are  special  data-type  frames,  ail  with  a  slot  named  datum.  It  can  be 
filled  with  an  attribute  value  from  a  related  entity  (to  avoid  duplicate  requests),  in 
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which  case  it  is  represented,  by  a  where  clause,  such  as  in  e "ry ill e nia-marginat um: 

location  :  choice  where  (datum  «-  erythema. location)  ; 

When  the  where  clause  is  omitted  the  datum  will  be  requested  and  the  attribute 
type  indicates  howr  the  user  should  be  prompted  for  the  data.  For  the  above  exam¬ 
ples,  the  system  will  generate  the  following  requests: 

(i)  Is  the  erythema  symmeir'ical ? 

(ii)  Which  one(s)  of  the  following  describes  the  origin  of  the  erythema 

(1)  trunk; 

(choices  are  collected  from  the  conditions  in  which  the  attribute  appears) 

(iii)  What  is  the  blood  agar  hemolysis  of  the  streptococcal  infection ? 

Attributes  such  as  sex  and  a^e  are  attached  to  the  "entity”  -patient,  and  lab  data 
such  as  ESR  (erythrocyte  sedimentation  rate)  and  WBC  (white  blood  cell  count)  in 
blood-test  are  attached  to  the  corresponding  test  type  in  the  same  manner.  They 
also  serve  the  purpose  of  guiding  user’s  input.  It  is  possible  to  relieve  the  physician 
of  the  chore  of  inputting  such  routine  data  by  having  them  entered  with  a  mark- 
sense  input  device  using  special  forms  prepared  by  the  lab  technicians. 

condition 

This  feature  serves  dual  purposes.  Firstly,  it  provides  a  way  of  grouping  a  col¬ 
lection  of  signs  and  symptoms  together  to  describe  a  complex  clinical  state.  This  is 
needed  in  situations  where  each  symptom  alone  is  too  non-specific  to  provide  any 
useful  information  but  together  they  can  bo  of  significant  diagnostic  value;  or  where 
the  symptoms  are  strongly  dependent  on  each  other  so  that  the  association  factor 
cannot  be  easily  divided  among  the  individual  symptoms.  For  example, 
subvalvarfibromiLscular-steTiosis,  has  an  auscultation  condition: 
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systoLic-thrill  and  ejection-murmur  and  no  ejeciion-ctick. 


Generally,  basic  findings  obtained  from  a  single  test  procedure  are  usually  governed 
by  the  same  pathophysiological  mechanism  and  are  therefore  strongly  interrelated. 
Secondly,  relational  operators  such  as  >,  5  and  =  can  also  appear  in  a  condition. 
Together  with  the  logical  operators  no,  or  and  and  (in  order  of  precedence,  which 
can  be  altered  with  parentheses),  they  can  form  complicated  expressions  and  can  be 
used  for  interpreting  test  data.  For  instance,  a  manifestation  of  streptococcal- 
infection  is  the  blood-test  condition  elevaled-ASO  (anti-strep toLy sin-0)  -Litre,  which 
demonstrates  a  high  level  of  streptococcal  antibody  in  the  blood: 


(patient. age-group  =  child  and  blood-test. ASO-titre  >  333) 
or  (patient. age-group  =  adult  and  blood-test. ASO-titre  >  30G) 


Not  only  does  condition  help  to  alleviate  the  difficulty  with  dependence  between 
symptoms  that  plagues  the  Bayesian  approach  (c/.  the  clustering  technique  in  the 
statistical  approach),  it  retains  the  uniformity  of  the  knowledge  base.  These  "com¬ 
plex  situations"  are  treated  in  the  same  way  as  other  types  of  disease  entities. 
Their  frames  are  also  tagged  with  knowledge,  such  as  information  for  triggering 
disease  hypotheses,  and  with  a  cost  factor.  They  therefore  fit  nicely  into  both  the 
data-directed  and  hypothesis-directed  aspects  of  the  control  scheme.  They  are 
evaluated  when  findings  for  the  corresponding  test  procedures  have  been  entered. 

It  must  be  noted  that  not  all  terms  in  the  expression  can  be  treated  as  the  reg¬ 
ular  type  of  logical  variables,  since  the  presence  of  a  disease  entdjr  can  cither  be 
true,  false,  or  undetermined.  In  fact,  in  order  to  be  able  to  assign  a  continuous  cer¬ 
tainty  measure  to  a  condition,  a  scheme  which  is  a  step  beyond  multi- valued  logic 
has  to  be  adopted. 
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manifestations 


This  is  the  collection  of  the  disease  entities,  including  basic  findings  and  patho¬ 
physiological  states,  whose  presence  will  be  used  as  the  sole  evidence  for  the 
confirmation  of  the  presence  of  the  disease  entity,  Di,  itself.  The  term  manifesta¬ 
tions  may  be  misleading  here,  since  it  is  extended  to  include  various  types  of  evi¬ 
dence,  such  as  the  patient’s  age  and  occupation  (for  occupational  hazards!).  The 
slot  for  each  manifestation  consists  of  four  possible  components: 

<slot  name>  :  <disease  entity>  [such  that  <condition>] 

< quantified  present  [when  <condition>]  ; 

< disease  entity>  is  the  name  of  the  manifestation,  Mj.  It  is  associated  with  a 
parameter  which  will  be  referred  to  as  the  necessity  factor,  NF[DitMj].  This  factor 
can  take  on  a  fractional  value  in  the  interval  from  0  to  1  inclusive,  and  it  represents 
a  rough  measure  of  how  necessary  the  presence  of  this  particular  manifestation  is 
for  confirming  that  the  patient  has  the  disease  or  is  in  the  clinical  state  under  con¬ 
sideration.  NF  is  loosely  related  to  the  frequency  with  which  patients  with 
confirmed  diagnosis  Di  will  display  Mj  as  a  manifestation  of  that  disease  (F(Mj  \Dj), 
as  used  in  INTERNIST).  Intuitively,  the  more  frequently  a  manifestation  appears  with 
its  associated  disease,  the  more  necessary  a  condition  it  is,  or  in  other  words,  the 
stronger  its  absence  is  as  evidence  against  the  disease  hypothesis.  It  is  given  the 
value  1  when  the  manifestation  is  always  present  along  with  the  disease  and  is  there¬ 
fore  absolutely  necessary  to  be  present.  The  absence  of  such  manifestation  will 
immediately  rule  out  the  possibility  of  having  the  disease.  For  instance,  patients 
with  left-ventricular- failure  almost  always  suffer  from  dyspnea  (note,  however,  that 
the  presence  of  this  symptom  is  rather  insuggestive).  On  the  other  hand,  if  Mj 
rarely  occurs  in  Dt  so  that  the  fact  that  Mj  is  missing  suggests  virtually  nothing 
about  Dit  then  NF[Dj,Mj]  w  0.  Hemoptysis  is  one  such  manifestation  of  LVF.  In  such 
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cases,  the  manifestations  should  still  be  included  in  the  list  anyhow  in  order  to  keep 
a  complete  record  of  all  the  manifestations  accountable  by  the  disease,  no  matter 
how  rare  they  may  be.  A  discrete  scale  which  splits  the  possible  value  of  this 
parameter  into  five  levels,  namely  always ,  usually,  often,  sometimes  and  rarely,  is 
used  as  a  more  natural  convention  for  the  physician  to  quantify  the  "strength"  of 
the  clinical  relationships.  The  discussion  on  the  model  of  reasoning  and  what  is 
exactly  to  be  done  with  the  list  of  manifestations  will  clarify  how  the  NF s  are 
assigned.  Essentially,  this  provides  information  on  reevalutation  of  the  likelihood  of 
a  disease  hypothesis  when  an  expected  finding  is  absent. 

<Condition>  is  the  same  as  that  described  in  the  previous  section  and  they 
share  the  same  form.  The  <condition>  in  the  when  clause  represents  the  clinical 
situation  under  which  the  manifestation  is  expected  to  be  present.  Only  when  the 
condition  is  satisfied,  will  the  manifestation  ever  be  considered.  The  age  and  sex  of 
the  patient  are  commonly  found  in  a  <condition>.  This  can  also  be  used  to  handle 
the  excuse  that  is  sometimes  available  for  the  absence  of  an  expected  finding. 
Streptococcal-infection  provides  good  examples  of  such  <condition>s.  It  often 
causes  the  ASQ-titre  to  rise  several  weeks  after  the  infection,  indicating  that  the 
body  is  fighting  the  infection  with  antibodies.  Taking  penicillin  to  combat  the  infec¬ 
tion,  however,  often  squelches  the  antibody  response.  The  condition  no  pencillin  will 
serve  as  an  excuse  for  elev ate d-ASO -litre.  The  streptococcal-infection  hypothesis  is 
evaluated  as  if  elevated-ASO-titre  were  not  a  relevant  symptom  whenever  penicillin 
has  been  taken. 

In  the  optional  such  that  clause,  the  <condition>  specifies  the  expected  charac¬ 
teristics  of  the  manifestation.  It  must  be  satisfied  before  an  expectation  will  be  con¬ 
sidered  matched.  As  mentioned  before,  the  use  of  attributes  here  facilitates  the 
economy  of  representation.  When  a  condition  is  not  given,  it  is  considered  as  one 
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that  is  always  a  hundred  percent  true. 


evidence-of 

These  are  the  backward  links  to  the  possible  "causes"  of  the  disease  entity,  Di, 
and  are  used  for  activating  disease  hypothesis.  These  triggers,  being  the  source  of 
the  symptom-centered  diagnostic  information,  are  very  important  for  the  decision¬ 
making  process.  The  format  of  each  slot  is  similar  to  that  of  a  manifestation.  The 
parameter  associated  with  each  cause,  Ck,  is  called  the  sufficiency  factor, 

It  indicates  how  sufficient  the  presence  of  Di  is  as  a  condition  for  that  of  C k,  and  is 
entered  into  the  computation  of  the  certainty  factor  or  level  of  confidence  of  Ck.  Its 
value,  which  also  lies  between  0  and  1  inclusive,  represents  the  likelihood  that  if 
manifestation  Di  is  observed  in  a  patient,  diagnosis  Ck  is  its  cause  (relative  to  all 
other  causes  of  D *),  L(Ck\Di).  This  plays  a  role  similar  to  that  of  the  "evoking 
strength"  in  INTERNIST.  A  value  of  0  indicates  that  the  manifestation  is  too 
nonspecific  to  draw  any  diagnostic  conclusions,  and  1  indicates  Di  is  pathognomonic 
for  Ck,  i.e.  the  observation  of  Di  is  sufficient  for  us  to  conclude  that  C*  is  present. 
Vomiting  and  the  ECG  condition  digitalis- toxicity-ECG: 

(paroxymal- atrial-tachycardia  and  atrioventricular-block) 
are,  respectively,  these  two  types  of  manifestations  of  digitalis-toxicity .  The  value  of 
this  parameter  can  also  be  divided  into  five  levels:  definitely ,  strongly ,  moderately, 
weakly  and  insignificantly  suggested,  for  user’s  convenience. 

Again,  a  <eondition>  may  also  be  associated  with  each  cause,  indicating  under 
what  clinical  situation  the  cause  can  be  considered  possible.  Such  a  condition  will 
help  to  describe  the  state  of  the  patient  better  and  provide  stronger  support  for  the 
confirmation  of  the  cause  than  the  manifestation  alone  will.  For  instance,  chest-pain 
without  any  other  information  is  too  general  a  symptom  to  allow  tracking  down  its 
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cause.  It  is  therefore  characterized  by  a  few  attributes  such  as  its  location,  quality 
and  quantity.  These  characteristics  appear  in  the  conditions  of  the  possible  causes 
as  they  provide  much  more  diagnostic  information.  One  of  the  causes  is: 

pericarditis  strongly  suggested  when 

(chest-pain  .  origin  =  "substemal  anterior  chest"  , 
radiation  =  "left  scapula”  , 
quality  =  "sharp”  , 

aggravating/alleviating -factors  =  "deep  breathing,  lying  down"  . 
duration  =  "related  to  each  heartbeat"  )  ; 

In  many  situations,  competing  disease  hypotheses  have  manifestations  that  are 
quite  similar  and  confusing.  In  medical  practice  these  are  identified  and  differential 
diagnosis  procedures  are  designed  to  pin  down  the  correct  one.  Generally,  specific 
symptoms  that  strongly  support  a  single  disease  are  searched  for.  The  necessary 
information  can  be  conveniently  encoded  with  conditions  which  have  to  be  satisfied 
before  the  corresponding  disease  hypotheses  are  activated.  However,  these  condi¬ 
tions  are  simply  manifestations  of  the  possible  causes,  with  a  high  sufficiency  factor 
for  one  particular  cause  but  low  for  the  others.  In  fact,  the  system  is  capable  of 
selecting  such  symptoms  with  high  diagnostic  value  and  these  data  arc  therefore 
somewhat  redundant.  None  the  less,  they  are  useful  for  directly  reducing  the 
number  of  hypotheses  active  at  the  same  time  in  the  situations  mentioned  above. 
For  example,  edema  can  come  from  numerous  possible  etiologies  including  diseases 
of  cardiovascular,  renal,  endocrinological  and  gestro-intestinal  origins.  If  a  uri- 
nanalysis  shows  that  heavy-proteinuria  is  not  present,  then  the  renal  diseases  can 
be  safely  ruled  out  as  the  possible  cause. 
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associations 


These  constitute  the  last  set  of  disease  entities  that  are  used  in  controlling  the 
activation  of  hypotheses.  A  cause-manifestation  relation  between  two  disease- 
entities  implies  the  inheritance  of  findings  from  the  manifestation  to  the  cause;  that 
is,  every  finding  that  may  show  in  the  presence  of  the  manifestation  may  also 
appear  with  the  cause,  and  so  its  presence  is  an  evidence,  indirect  though  it  may 
be,  of  the  cause.  The  disease  entities  therefore  form  a  PART-OF  hierarchy  (with 
manifestations  as  parts),  in  which  the  basic  findings  appear  at  the  bottom  and  the 
ultimate  etiologies  at  the  top.  Here,  associations  refer  to  all  other  relations 
between  elementary  hypotheses  where  the  symptoms  of  the  two  diseases  concerned 
are  not  in  a  subset/superset  relation.  The  relationship  may  be  "causal”  if  the  phy¬ 
siology  of  the  disorders  is  well  understood,  may  be  "complicational"  if  one  disorder 
is  a  typical  complication  of  the  other,  or  may  be  "associational”  if  the  two  may  be 
related  by  some  known  but  incompletely  understood  association.  It  also  includes  a 
less  common  type  of  relation  between  two  diseases  which  are  related  but  incompati¬ 
ble  with  each  other,  that  is,  the  two  tend  not  to  show  up  together.  Rejecting  symp¬ 
toms  that  are  used  in  Rubin’s  system,  for  the  diagnosis  of  renal  diseases,  to  reduce 
the  number  of  active  hypotheses  (e.g.  the  presence  of  red-blood-cell-casls  rules  out 
the  diagnosis  of  sickle-cell-trait)  can  be  coded  with  this  feature.  An  important 
difference  between  an  association  and  a  manifestation  is  that  although  the  former 
plays  a  very  useful  role  in  focusing  the  system's  attention,  it  does  not  enter  into  the 
criterion  for  accepting  the  disease  hypothesis  while  the  latter  does.  The  reason  for 
this  distinction  will  be  discussed  later. 

An  associational  slot  may  have  five  components: 

<slot  name>  :  <disease  entity>  [such  that  <condition>] 

<quantifier>  <association  type>  [when  <condition>] 
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Assigned  to  the  associated  <disease  entity>,  Alt  is  a  factor  in  the  range  -1  to  +1 
inclusive.  It  is  called  the  compatibility  factor,  CF[pi,A{\,  a  term  borrowed  from  the 
method  of  relaxation  labeling  [Rosenfeld76]  —  a  powerful  technique  which  finds  suc¬ 
cessful  applications  in  image  processing  at  different  levels  of  abstraction  [Hanson78] 
[Riseman78]  [Zucker77]  and  motion  understanding  [Tsotsos80].  As  a  matter  of  fact, 
these  factors  are  used  for  the  same  purpose,  namely  to  provide  a  means  of  evaluat¬ 
ing  the  hypotheses  in  the  more  global  context  of  cooperation  and  competition 
among  the  hypotheses.  Similar  to  the  "evoking  strengths",  they  can  be  related  to 
the  likelihood  that  the  association  Ai  is  present  (absent  if  CF  <  0)  when  the  disease 
Di  is  present.  The  compatibility  factors  of  the  complementary  relations  are  there¬ 
fore  positive  and  those  of  the  competing  relations  are  negative.  At  one  extreme  of 
the  spectrum  where  CF\Pi,A{]  —  +1,  D *  cannot  exist  without  Ait  while  at  the  other 
extreme  where  CF  =  -1,  A,  must  not  be  present  in  the  presence  of  Di.  Intermediate 
values  indicate  different  degrees  of  compatibility  between  related  disease  entities. 
Note  that  CF[Di,Ai\  is  not  necessarily  equal  to  CF\Ai,Di ]  since  neither  causal  nor 
associational  relations  are  symmetrical.  The  compatibility  factor  in  the  reverse 
direction,  CF\A\,Di\  is  found  in  the  separate  frame  for  At. 

Again,  this  parameter  can  be  quantified  with  a  discrete  scale  of  five  levels. 
There  are  four  possible  association  type>  that  can  be  used  to  indicate  the  type  of 
the  relation:  related  for  complementary  relations  and  conflicts  for  competing  rela¬ 
tions.  In  the  former  type  of  relations,  follows  and  precedes  can  be  used  to  capture 
the  temporal  or  causal  information  of  the  association,  if  known.  The  distinction  of 
the  direction  of  the  relation  in  such  cases  merely  helps  phrasing  the  final  diagnoses 
to  show  their  causal  relationships.  Note  that  these  associational  relations  are  not  as 
clearly  understood  as  the  manifestation/evidence-of  relations  and  their  associated 
parameters  can  only  be  roughly  estimated.  However,  since  as  mentioned  above,  the 
CF s  are  used  oniy  in  the  heuristic  for  controlling  the  focus  of  attention,  their  exact 
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values  are  not  crucial.  This  is  one  of  the  advantages  of  the  adopted  scheme. 

important 

This  is  a  condition  which  indicates  when  special  attention  should  be  given  to  the 
disease  entity.  For  an  observed  manifestation  it  signifies  whether  the  finding  can  be 
readily  ignored  --  null  (default  value)  if  it  could  be  considered  a  "red  herring"  or 
always  if  it  must  be  explained  by  one  of  the  final  diagnoses.  Cough  and  hemupiys-is 
(expectoration  of  blood)  are  respectively  examples  of  the  two  types  of  findings. 
Also,  in  selecting  a  treatment  procedure,  the  important  condition  of  a  possible  diag¬ 
nosis  will  be  taken  into  consideration.  A  disease  that  has  a  high  risk  factor  and 
responds  well  to  treatment  may  merit  therapeutic  attention  even  when  it  has  a  rela¬ 
tively  low  likelihood.  The  important  feature  therefore  provides  a  means,  in  a  limited 
way,  of  introducing  cost-benefit  consideration  into  the  decision  process. 

treatment 

An  attempt  is  made  here  to  have  the  system  handle  the  selection  of  treament, 
as  diagnosis  is  hard  to  divorce  from  therapy  in  any  practical  sense.  The  required 
information  used  by  the  system  is  also  centered  around  the  diagnosed  diseases,  just 
as  it  is  centered  around  the  identified  organisms  in  MYCIN.  Each  disease  is  associ¬ 
ated  with  a  list  of  potential  therapies,  which  will  be  considered  when  the  disease  has 
a  sufficiently  high  likelihood  of  being  present.  However,  since  the  nature  of  the 
treatment  such  as  surgical  operations  and  medications  can  be  widely  different,  it  is 
very  difficult  to  devise  a  scheme  for  comparing  the  effectiveness  of  the  different 
therapies  (similar  to  the  way  the  MYCIN  selects  drugs).  Each  treatment  is 
represented  by  a  frame  which  attributes  are  assigned  specific  details  for  treating  a 
particular  situation,  as  in  the  ARF  frame.  Since  drug  therapy  can  be  more  easily 
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characterized,  and  compared,  only  medications  are  each  represented  by  a  separate 
frame,  with  attributes  such  as  dosage,  duration  and  goal.  Other  therapeutic  pro¬ 
cedures  are  collectively  categorized  under  the  general  frame.  Anyhow,  all  attribute 
values  are  currently  simply  text  and  do  not  enter  in  the  decision  making  processes. 
Potential  use  of  such  information  will  be  discussed  in  the  last  chapter. 

Each  treatment  can  be  associated  with  a  <condition>  which  must  be  evaluated 
to  be  true  before  the  treatment  will  be  suggested.  This  condition  could  be  contrain¬ 
dication  of  the  treatment.  For  example,  a  surgical  procedure  can  only  be  considered 
when  the  patient  is  in  a  state  suitable  for  operation  (e.g.  it  is  dangerous  to  perform 
an  Saphenous  vein  bypass  graft  as  a  treatment  of  angina-pec toris  on  a  patient  with 
congestive-heart- failure)-,  and  some  drugs  are  not  suitable  for  certain  types  of 
patients  (e.g.  tetracycline  cannot  be  given  to  young  children).  The  condition  could 
also  provide  a  more  detailed  description  on  the  clinical  state  of  the  patient  so  as  to 
tailor  the  treatment  to  the  case  under  consideration.  This  approach  in  which  the 
symptoms  are  treated  individually  is  known  as  symptomatic.  Note  that  such  condi¬ 
tions  may  require  further  tests  to  be  carried  out  or  observations  of  the  disease’s 
evolution. 

Finally,  as  mentioned  before,  the  condition  (• important  slot),  under  which  the 
risk  that  the  patient  might  be  subject  to  if  its  presence  is  missed  and  not  treated 
becomes  too  high  to  be  ignored,  will  also  be  taken  into  consideration  in  selecting 
treatment.  For  instance,  it  is  suggested  that  unexplained  fever  for  more  than  a 
week  or  ten  days  in  patient  with  significant  cardiac  murmur  should  form  the  basis  of 
a  presumptive  diagnosis  of  bacterial  endocarditis,  which  may  appear  in  many  dis¬ 
guises  and  is  thus  hard  to  diagnose. 
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The  use  of  IS-A  hierarchy  is  a  popular  way  of  organizing  a  knowledge  base.  It 
provides  a  means  of  expressing  the  notion  of  generalization  of  concepts,  and  very 
often  helps  to  economize  in  representation  through  the  rule  of  inheritance.  This, 
however,  has  limited  applicability  in  our  task  domain.  No  classification  scheme  that 
progressively  categorizes  a  very  general  class  of  diseases,  such  as  cardiovascular 
diseases,  into  the  specific  disorders  has  ever  been  presented.  Classifying  a  disease 
with  such  a  scheme  step  by  step  is  certainly  not  the  way  the  physician  goes  about 
making  diagnosis.  If  we  are  to  follow  closely  the  reasoning  steps  of  the  physician  so 
as  to  make  the  system  more  understandable  to  him,  we  cannot  force  him  to  con¬ 
struct  an  IS-A  tree  on  which  all  the  disease  frames  must  be  hung.  The  generaliza¬ 
tion  mechanism  that  would  have  been  used  would  hide  some  information  from  the 
user,  thus  generating  ''unpredictable"  behaviour.  Anyhow,  the  usefulness  of  the  con¬ 
cept  of  inheritance  is  very  limited  here.  In  order  to  attach  a  manifestation  to  a 
general  disease  category  it  must  be  a  possible  manifestation  of  alt  the  specializa¬ 
tions.  Associated  with  the  manifestation  in  each  specialization  are  evidential 
strengths  and,  conditions  for  its  occurrence  and  its  expected  characteristics.  This 
information  cannot  be  easily  generalized  across  the  disease  class  without  loss  of 
valuable  knowledge. 

In  our  representation  we  do  however  provide  a  means  of  incorporating  a  control 
scheme  for  the  generalization  of  disease  entities.  It  is  useful  when  different  speciali¬ 
zations  of  a  general  disease  share  a  common  set  of  manifestations  that  have  little 
diagnostic  value  for  each  individual  specific  disease,  but  provide  good  evidence  for 
the  general  category.  In  such  situations,  this  will  allow  the  system  to  hypothesize  a 
diagnosis  at  a  general  level  and  from  it  construct  a  differential  diagnosis  to  classify 
the  disease  more  specifically  for  proper  treatment.  Observing  the  design  principle 
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of  uniformity,  we  allow  a  disease  entity  to  be  a  general  class  of  diseases.  Inside  its 
frame  are  kept  the  non-specific  manifestations  and,  as  its  evidence-of  slots,  the  spe¬ 
cializations  together  with  their  differential  diagnoses  (represented  as  when- 
conditions).  The  specific  disease  entities  are  connected  to  the  general  disease 
frame  via  the  is-a  links,  for  example,  frame  group -A-streptococcal-infection  is -a 
streptococcal-infection.  Note  that  a  disease  entity  may  have  more  than  one  is-a 
relationship  and  it  is,  though  useful  for  explanatory  purposes,  not  mandatory  to 
have  a  link  between  two  entities  with  such  relationship.  Each  link  simply  indicates 
where,  in  the  general  disease  frame,  diagnostic  information  for  the  specific  disease 
can  be  found,  being  non-re strictive,  this  feature  allows  the  user  to  incrementally 
update  the  knowledge  base  more  easily.  The  general  disease  frames  can  be  intro¬ 
duced  when  the  need  for  them  becomes  apparent. 


2.4.  Use  of  the  representation 

In  this  section  we  shall  reexamine  the  frames  in  Figures  2.2-1  through  2.2-5  to 
illustrate  some  of  the  more  intricate  relations  that  are  represented  in  them.  First 
note  the  patient  manifestation  in  the  frame  of  acute  rheumatic  fever.  Patient  is 
treated  like  other  disease  entities  in  that  it  has  the  general  information  about  the 
patient  as  its  attributes.  In  fact,  to  make  use  of  the  general  mechanism  for 
requesting  attribute  values,  it  is  set  up  as  a  frame  structure.  This  allows  convenient 
representation  of  statistical  information  on  the  correlation  of  particular  diseases 
with  general  characteristics  of  the  patients,  such  as  sex  and  age  group. 

It  is  known  that  acute  rheumatic  fever  is  caused  by  group  A  streptococcal 
infection,  although  the  pathological  mechanism  involved  is  not  well-understood. 
Therefore,  strep.  A  infection  always  precedes  the  occurrence  of  ARF  and  to  accept 
the  diagnosis  of  ARF  we  should  demonstrate  the  presence  of  the  infection,  usually 
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through  lab  tests  (see  the  frame  for  strep,  infection).  The  presence  of  previous 
strep,  infection  cannot  be  easily  detected  when  ARF  has  entered  a  later  stage  which 
is  indicated  by  the  symptom  of  chorea.  With  the  use  of  a  when-condition  we  are 
able  to  capture  this  temporal  relation  between  the  two  disease  entities.  However, 
this  does  not  allow  us  to  put  such  temporal  information  as  the  exact  time  span  of  a 
disease  into  the  knowledge  base.  It  can  be  argued  that  such  data  are  usually  highly 
variable  and  carry  much  less  diagnostic  information  as  compared  to  the  "relative'’ 
temporal  relations. 

We  next  examine  the  is-a  relationships  involved  in  these  frames.  Strep,  infec¬ 
tion  can  be  classified  into  several  groups  (e.g.  A,  C,  G,  H,  etc)  which  may  lead  to 
different  complications.  ARF  is  specifically  caused  by  group  A  infection,  Bind  so  it  is 
important  to  distinguish  between  the  different  groups  represented  as  separate 
frames.  These  different  groups  share  a  few  nonsuggestive  symptoms  (e.g.  fever)  and 
non-specific  lab  tests  (e.g.  measurement  of  the  ASO  titre  and  ESR).  This  information 
is  therefore  kept  in  the  general  frame  for  strep,  infection.  In  order  to  make  use  of 
it  in  the  diagnosis  of  a  specific  group,  an  is-a  link  is  provided.  Incorporated  in  the 
strep,  infection  frame  is  also  information  for  the  differential  diagnosis  of  the  special¬ 
izations,  which  includes  the  possible  results  from  hemolytic  reaction  on  blood  agar 
(not  very  specific).  This  generalization  can  be  further  carried  over  the  class  of 
infectious  diseases  which,  while  showing  manifestations  of  different  nature,  share 
some  similar  test  procedures  such  as  observations  of  the  gram  stain,  aerobicity  and 
shape  of  the  infecting  organism.  This  information  can  similarly  be  represented  as 
attributes  of  infectious  diseases  and  placed  in  the  conditions  of  the  evidence-of 
slots.  This  suggests  an  alternative  representation  for  a  portion  of  the  information 
used  by  the  MYCIN  system.  In  addition,  physical  manifestations  that  suggest  the 
presence  of  infectious  disease  in  the  first  place  are  also  represented.  It  is  observed 
that  since  the  causal  relation  between  strep.  A  infection  and  AKF  is  rare  but  is 
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usually  used  as  a  necessary  condition  for  the  confirmation  of  ARF  it  is  not  impor¬ 
tant  to  distinguish  between  the  different  groups  as  far  as  the  diagnosis  of  ARF  is 
concerned.  We  can  therefore  directly  associate  it  with  strep,  infection  as  it  is  done 
in  most  texts.  Note  that  although  erythema  marginatum  is  a  specific  kind  of  rash 
(erythema)  and  its  presence  can  be  directly  requested  from  the  user  without  first 
considering  the  more  general  disease  entity,  in  the  frame  for  erythema  there  is  the 
following  evidence-of  slot: 

erythema-marginatum  definitely  suggested 

when  (erythema. appearance  =  "pale  centres,  serpiginous  margins”) 

so  that  when  the  user  indicates  the  presence  of  rash  in  the  patient,  he  will  be 
requested  to  characterize  its  appearance  in  order  not  to  miss  the  diagnosis  of 
erythema  marginatum. 

Carditis,  which  may  involve  the  different  layers  of  the  heart  muscules  (i.e.  peri¬ 
cardium,  myocardium  and  endocardium),  is  sometimes  present  in  patient  with  ARF. 
It  can  be  considered  as  a  general  class  of  diseases  that  include  pericarditis,  myocar¬ 
ditis  and  endocarditis.  However,  what  we  are  interested  in  here  is  just  the  indica¬ 
tion  of  cardiac  involvement  as  an  evidence  of  the  presence  of  ARF.  The  individual 
types  of  carditis  themselves  are  not  suggestive  of  ARF,  but  are  considered  a  major 
manifestation  when  it  is  suspected.  This  agrees  with  our  representation:  being  a 
specific  condition  which  is  satisfied  when  any  kind  of  carditis  is  present,  it  strongly 
suggests  ARF  when  the  condition  is  tested  and  turns  out  to  be  true.  This  frame  is 
however  not  activated  by  the  carditis  frames  alone  (without  having  the  ARF  frame 
activated)  —  there  are  no  evidence-of  links  between  them.  To  indicate  this  close 
relation  with  the  ARF  frame,  it  is  named  rheumatic  carditis  (a  medical  term).  Note 
that  when  rheumatic  carditis  is  present,  the  specific  carditis  frame(s)  will  be 
activated  so  that  treatment,  independent  of  ARF,  can  be  suggested. 
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Finally,  we  consider  the  Jones  criterion  which  is  widely  used  as  a  guideline  for 
the  diagnosis  of  ARF.  The  manifestations  of  ARF  are  classified  as  either  major  or 
minor.  Its  diagnosis  is  confirmed  when  there  are  two  major  findings  or  one  major 
finding  with  two  minor  ones.  This  seems  to  agree  with  our  discretely  classifying  evi¬ 
dences  into  five  levels  and  basing  the  confirmation  of  a  disease  on  the  aggregate  evi¬ 
dential  strengths.  This,  of  course,  is  not  the  general  procedure  used  by  the  physi¬ 
cian  to  make  diagnosis;  and  we  do  make  use  of  other  associational  information  and 
an  attention-focusing  mechanism  to  reach  clinical  decisions.  These  will  be  ela¬ 
borated  in  the  next  chapter. 


2.5.  Remarks 

In  this  representation,  diseases,  syndromes,  signs,  symptoms  and  lab  tests  are 
treated  uniformly  —  each  of  them  can  be  considered  as  a  node  in  a  PART-OF  hierar¬ 
chy  and  the  hypothesis  each  represents  is  based  on  the  evidence  provided  by  its 
parts.  Treating  findings  as  hypotheses  allows  us  to  associate  an  uncertainty  meas¬ 
ure  with  them,  in  order  to  handle  fuzziness  in  the  data  and  provide  a  smooth  inter¬ 
face  for  the  low-level  subsystems  (section  3.9).  On  the  other  hand,  high-level 
hypotheses  can  be  treated  as  findings  and  can  therefore  be  evoked  directly  by  the 
user  (so  that  the  user  can  interact  more  efficiently  with  the  system)  or  by  a  subsys¬ 
tem  that  is  capable  of  generating  such  hypotheses.  This  representation  also  sup¬ 
plies  a  clear  and  convenient  way  of  expressing  modular  chunks  of  knowledge.  Each 
frame  independently  represents  the  knowledge  we  need  for  a  disease  entity.  In 
common  with  production  systems,  one  frame  never  directly  calls  another  as 
hypothesis  activation  will  be  handled  by  the  control  mechanism.  This  is  a  significant 
advantage  in  integrating  a  new  frame  into  the  system  —  it  can  simply  be  added  to 
the  knowledge  base  and  no  other  frames  need  be  changed  to  insure  that  it  is  called 
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(c/.  the  addition  of  new  procedure  to  a  typical  program).  Following  this  idea  the 
control  mechanism  is  so  designed  that  the  acceptance  conditions  for  each  disease 
entities  can  be  tested  independently  (section  3.2). 

Although  INTERNIST’S  developers  resisted  identifying  the  parameters  they  used 
as  probabilities,  the  frequency  parameter  is  clearly  analogous  to  the  conditional  pro¬ 
bability  P{Mj\Di )  and  the  evoking  strength  is  like  a  posterior  probability  P(Di\M *). 
If  we  were  to  take  such  a  probabilistic  interpretation,  all  the  usual  complaints  about 
the  failure  of  Bayesian  assumptions  would  be  appropriate,  especially  the  difficulties 
caused  by  the  dependency  of  strongly  related  symptoms.  Here  the  model  of  clinical 
reasoning  (see  next  chapter),  though  has  some  probabilistic  flavor,  is  not  based  as 
strongly  on  the  manipulation  of  conditioned  probabilities.  The  provision  of  the  condi¬ 
tion  feature  for  capturing  the  complexity  of  some  clinical  state  alleviates  the  prob¬ 
lem  with  dependent  symptoms,  since  they  can  be  grouped  together  and  treated  as  a 
single  disease  entity.  This  is,  in  fact,  common  in  medical  practice.  Syndromes,  "the 
aggregates  of  signs  and  symptoms  associated  with  any  morbid  process  and  constitut¬ 
ing  together  the  picture  of  the  disease",  are  defined  to  describe  specific  clinical 
states.  From  the  exercise  of  coding  medical  knowledge  with  our  representation,  the 
importance  of  having  the  abundance  of  medical  terms  and  jargons  for  describing 
different  clinical  situations  is  appreciated. 
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Chapter  3 


Model  of  Clinical  Decision  Making 


3.1.  Introduction 

In  last  chapter  we  described  the  information  that  is  encoded  in  our  knowledge 
base  and  have  shown  how  it  is  organized  as  small  packages  of  knowledge  associated 
with  individual  findings  and  diseases.  Here  we  proceed  with  a  discussion  of  a 
scheme  that  makes  use  of  this  information  to  provide  consultation  for  medical  diag¬ 
nosis.  This  chapter  examines  the  nature  of  the  "artistic"  and  unformalized  clinical 
decision  making  processes,  considers  its  relationship  to  formal  probability  theory, 
and  proposes  a  model  whereby  such  incomplete  knowledge  might  be  quantified.  The 
important  aspects  of  the  reasoning  process  are  explored  and  incorporated  in  a  uni¬ 
form  control  structure,  The  decision  criteria  employed  by  the  system  in  making  a 
diagnosis  are  stated  explicitly,  as  required  by  the  analytical  approach  to  medical 
reasoning. 

3.2.  General  approach 

In  order  that  our  system  will  be  accepted  as  a  helpful  assistant  for  making 
medical  diagnosis,  it  must  be  able  to  peform  the  job  in  a  ’'natural"  way  lhaL  can  be 
easily  understood  by  the  physician.  The  best  approach  to  the  problem  is  perhaps  to 
try  to  mimic  the  reasoning  process  of  the  physician  and  reproduce  the  steps  taken 
to  arrive  at  the  final  decision.  However,  as  discussed  before,  this  cognitive  process 
is  extremely  complex  and  is  not  fully  understood.  The  analytical  approach  taken  by 
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computer  programs  will  necessarily  require  decision  criteria  to  be  stated  explicitly 
as  numerical  equations,  but  we  know  that  physicians  rarely  deal  with  numbers  in 
making  decisions.  Therefore,  we  cannot  aim  at  a  close  correspondence  between  the 
system's  and  a  physician’s  behaviour  in  every  detail  but  only  at  a  higher  level  of 
abstraction  where  we  explain  the  reasoning  steps  in  terms  of  their  goals.  In  this 
section,  the  overall  behaviour  of  the  system  will  be  described,  in  lernis  of  its  major 
components  and  the  way  that  these  components  interact.  The  underlying  processes 
and  the  decision  criteria  will  then  be  discussed  in  detail. 

The  basic  components  of  the  system  are  schematically  shown  in  Figure  3.2-1. 
Its  diagnostic  capability  is  the  result  of  the  interaction  of  the  five  parts:  (l)  Long- 
Term  Memory,  (2)  Short-Term  Memory,  (3)  Data  Inte'rpreter,  (4)  Question  Selector, 
and  (5)  Therapy  Selector  ( cf  PIP’s  organization). 

The  long-term  or  associative  memory  [Newell72]  contains  the  rich  collection  of 
general  medical  knowledge  organized  into  the  frames  as  described  in  the  previous 
chapter.  It  is  the  static  portion  of  the  knowledge  base,  which  provides  the  possible 
disease  models  for  each  case.  The  short-term  memory  is  the  site  where  data  about 
a  particular  patient  interact  with  the  general  knowledge  in  the  long-term  memory. 
The  amount  of  information  in  the  short-term  memory  is  quite  variable,  depending  on 
the  complexity  of  the  case  and  the  number  of  active  hypotheses.  Note  however  that 
psychological  researchers  believe  that  there  is  a  limit  on  human  capacity  for  pro¬ 
cessing  information  --  about  seven  "chuncks"  [Miller56].  The  activation  of  disease 
hypotheses  (i.e.  the  transfer  of  the  corresponding  aspects  of  the  general  knowledge 
into  the  short-term  memory)  and  how  such  knowledge  is  bound  to  the  patient- 
specific  data  are  determined  by  the  supervisory  program  to  be  described  next. 

The  supervisory  program  oversees  the  operations  of  various  subprocesses  and 
guides  the  interaction  of  the  system  with  the  user.  The  principle  goal  of  this 
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supervisor  is  to:  (1)  arrive  at  a  coherent  formulation  of  the  case,  by  quickly  generat¬ 
ing  and  testing  hypotheses  and  by  excluding  competing  hypotheses,  and  (2)  to 
recommend  treatment  for  the  final  diagnosis.  It  also  carries  out  the  functions  of 
answering  questions  about  the  system  knowledge  and  acquiring  new  knowledge  from 
the  users.  These  will  be  described  in  the  next  chapter.  Tn  trying  to  come  up  with  a 
diagnosis,  the  system,  much  like  the  physician,  tries  to  develop  a  disease  model  that 
matches  well  with  the  findings  obtained  from  the  patient,  in  order  to  form  a  reason¬ 
able  basis  on  which  to  evaluate  the  clinical  problem  and  to  lay  the  ground-work  for 
subsequent  information-gathering.  It  accomplishes  this  goal  by  undertaking  two 
processes,  one  is  model-directed  and  the  other  is  data-directed.  Although  these  two 
aspects  of  the  problem-solving  process  are  interrelated,  they  are  presented 
separately  for  clarity  of  exposition. 

The  data-directed  process,  carried  out  by  the  Data  Interpreter,  is  the  accumu¬ 
lation  of  a  profile  of  data  concerning  the  patient  and  inferring  from  the  given 
findings  the  possible  diagnosis.  It  determines  when  a  hypothesis  is  to  be  activated, 
accepted  or  rejected,  basing  the  decision  on  how  well  the  hypothesized  frame  fits 
the  data,  acquired  incrementally  from  the  user.  Because  there  are  innumerable 
facts  which  could  be  gathered,  one  needs  a  sharp  focus  for  this  activity.  This  focus 
is  provided  by  the  Question  Selector.  It  makes  use  of  the  disease  model  set  up  in 
the  short-term  memory  as  guideline  and  chooses  an  appropriate  tactic  for  eliminat¬ 
ing  or  confirming  hypotheses,  depending  on  the  number  of  active  hypotheses  and 
their  relative  estimated  likelihoods.  Control  is  alternated  between  the  two 
processes,  in  accordance  with  the  popular  hypothesize-and-test  paradigm,  until  all 
important  findings  are  accounted  for  by  the  set  of  confirmed  disease  hypotheses. 
Treatment  for  each  individual  disease  is  then  recommended  by  the  Therapy  Selec¬ 
tor,  based  on  the  clinical  condition  of  the  patient. 
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Figure  3.2-1  Schematic  diagram  of  the  system  components 
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3.3.  Model  of  evidential  strength 


Researchers  in  the  field  of  computer-aided  medical  diagnosis  have  all  found  a 
need  to  quantify  inferences,  be  they  causal  or  associational,  with  numerical  weights. 
As  we  mentioned  before,  although  the  physician  very  seldom  deals  with  numbers  in 
his  reasoning,  in  his  language  we  can  find  terms  that  suggest  these  numerical 
quantifiers.  The  issue  is  then  perhaps  not  whether  numbers  should  be  used  in  our 
model  of  inference,  but  how  inferences  should  be  quantified  so  that  they  can  be 
readily  translated  back  into  the  kind  of  terms  or  ideas  that  they  are  intended  to 
convey  in  the  first  place.  A  related  issue  is  how  these  quantifiers  should  be  manipu¬ 
lated  in  order  to  make  decisions.  Can  we  use  statistical  theory  or  are  we  dealing 
with  issues  that  seem  to  go  beyond  statistics?  Again,  it  is  important  that  this  mani¬ 
pulation  process  be  easily  understood  by  the  physician  and  that  it  appeals  to  his 
common  sense. 

The  scheme  that  is  sought  is  one  with  a  sound  foundation  based  on  precisely 
defined  and  well-understood  concepts  which  are  also  intuitively  appealing,  and  not  on 
some  ad  hoc  scoring  function.  Such  a  scheme  will  allow  data  to  be  set  up,  tested 
and  adjusted  more  easily.  It  should  also  be  applied  in  a  uniform  manner  at  the 
different  levels  of  abstraction,  from  basic  findings  to  pathophysiological  states  and 
further  to  disease  categories,  so  that  the  reasoning  can  be  easily  explained  in  a  way 
much  similar  to  that  in  production  systems.  The  model  that  is  used  in  our  design  is 
based  on  that  of  the  MYCIN  system  [Shortliffe?5].  It  is,  in  effect,  an  approximation 
to  conditional  probability  and  parallels  results  available  from  using  Bayes’  Theorem. 
It  therefore  provides  ways  to  handle  decision  rules  as  discrete  packets  of  knowledge 
and  it  provides  a  quantification  scheme  that  permits  accumulation  of  evidence  in  a 
manner  that  adequately  reflects  the  reasoning  process  of  an  expert  using  the  same 
or  similar  pieces  of  information. 
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It  will  be  shown  how  the  sufficiency  factors,  necessity  factors  and  compatibility 
factors  can  be  interpreted  in  terms  of  a  single  concept,  namely  MYCIN’s  measure  of 
belief,  with  a  more  rigorous  approach  —  abandoning  the  direct  associations  with  pro¬ 
babilities,  and  differentiating  the  notions  of  the  likelihood  measure  of  a  certain  fact 
and  the  evidential  strength  of  a  finding.  The  modeling  scheme  will  be  presented 
below  following  the  way  the  problem  was  approached  and  the  motivation  behind  each 
step  will  be  discussed.  There  is  a  lucid  discussion  on  the  inadequacies  of  probability 
in  the  theory  of  confirmation  in  the  paper  cited  above. 


3.3.1.  Sufficiency  factors 

As  discussed  in  section  2.3,  the  sufficiency  factor  SF[Di,Mj]  is  a  measure  of  the 
evidential  strength  of  the  following  piece  of  knowledge: 

if  manifestation  Mj  is  present,  then  the  patient  has  disease  Di. 

This  is  to  say,  if  the  premise  is  known  to  be  true,  then  there  is  a  SF  x  100%  belief 
that  the  conclusion  is  valid.  This  is  not  quite  the  same  as  saying  that  the  probabil¬ 
ity  of  the  conclusion  being  true  is  SF  x  100%,  as  might  be  suggested  by  the  case  SF 
=  1.  Consider  the  extreme  situation  where  SF  -  0.  This  means  we  cannot  put  any 
faith  in  the  statement  at  all  (0%  belief),  and  so  the  presence  of  Mj  does  not  allow  us 
to  conclude  that  Z)4  is  absent  (P(Dt)  =  0)  and,  in  fact,  nothing  about  Di  as  far  as  this 
particular  statement  is  concerned. 

At  this  point  we  have  to  abandon  this  probabilistic  interpretation  of  SF  and. 
introduce  a  parameter  known  as  the  measure  of  belief,  denoted  by 

MB  [ H,E  ]  =  "the  measure  of  belief  in  the  hypothesis  H, 
based  on  the  evidence  E". 
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It  provides  a  scale  ,  from  0  to  1,  for  measuring  evidential  strength  of  a  piece  of 
information  subjectively.  Unity  means  there  is  definite  evidence  for  confirming  that 
the  hypothesis  is  true,  while  zero  means  there  is  none.  The  evidence  E  need  not  be 
an  observed  event,  but  may  be  a  hypothesis  (itself  subject  to  confirmation).  The 
sufficiency  factor  is  then  defined  as  the  proportionate  increase  in  this  measure  of 
belief  of  the  hypothesis  Hi  that  "the  patient  has  disease  ZV’  based  on  the  evidence 
Ej  that  "manifestation  M3-  is  present",  or  symbolically, 


SF[DitMj] 


MB [Hi,E 8cEj]  -  MB[HitE] 
1  -  MB  [H{lE  ] 


where  E  is  the  accumulated  evidence  related  to  the  hypothesis  H This  definition 
provides  us  an  update  funetion  for  the  measure  of  belief  of  a  hypothesis  in  light  of  a 
piece  of  new  confirming  evidence: 

MB[Hi,E8cEj]  =  MB\Hi,E ]  +  SFjA.ATjHl  -  MB[HitE)). 

Note  also  that  this  formula  bears  the  same  relationship  to  the  MBs  as  the  sequential 
diagnosis  form  of  Bayes’  Theorem  does  to  the  conditional  probabilities  (section 
1.4.1).  It  also  gives  the  following  properties  that  appeal  to  our  common  sense: 

(i)  SF  =  0,  i.e.  Ej  has  no  evidential  value  with  respect  to  77* 

=  >  MB  [H^  E  kEj]  =  MB[HitE  j. 

(ii)  SF  \_Dit  Mj ]  =  1,  i.e.  Ej  implies  Hi 

=  >  MB[Hi,E&cEs]  =  1. 

(iii)  MB[H,E]  remains  in  the  closed  interval  [0,  l],  and  increases  toward  1  as  confirming  evi¬ 

dence  is  found,  equalling  1  only  if  the  piece  of  evidence  logically  implies  H  with  cer¬ 
tainty,  i.e.,  "is  sufficient”  for  H. 

(iv)  Since  the  updating  function  above  is  symmetrical  with  respect  to  E  and  Ej, 

MB  [ H,E  \ScE2\  =  MB  [HSE2&E^\,  that  is,  the  order  in  which  pieces  of  evidence  are 
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discovered  does  not  affect  the  level  of  belief  in  a  hypothesis. 


3.3.2.  Necessity  factors 


We  would  like  the  necessity  factor,  NF[D^Mj]  to  give  us  an  idea  of  how  neces¬ 
sary  the  manifestation  Mj  is  for  disease  D*.  It  is  thus  a  measure  of  the  evidential 
strength  of  the  rule: 


if  manifestation  Mj  is  absent,  then  the  patient  does  not  have  disease  ZV 

We  can  again  define  the  necessity  factor  in  terms  of  the  new  parameter  as  the  pro¬ 
portionate  increase  in  the  measure  of  belief  of  the  hypothesis  //*  that  "the  patient 
does  not  have  disease  D based  on  the  evidence  Ej  that  "manifestation  Mj  is  not 

present",  that  is, 


NF\Pi,Mj] 


MB[ffi,EScEj]  -  MB[ffj,E] 

1  -  MB[H~,E] 


where  E  is  the  accumulated  evidence  related  to  the  hypothesis  H The  need  for 
separate  measures  of  belief  of  a  hypothesis  and  its  negation  is  recognized  by  the 
fact  that  MB[H,E  J  does  not  equal  1  -  MB  [H>E]  —  to  confirm  something  to  ever  so 
slight  a  degree  is  not  to  disconfirm  it  at  all,  since  the  favourable  evidence  for  some 
hypothesis  gives  no  support  whatever  to  the  contrary  supposition  in  many  cases. 
That  is,  when  MB[H,E]  >  0,  MB  [H,E  ]  =  0  and  when  MB[Ii.E ]  >  0,  MB[H,E]  =  0. 
Rearranging  the  above  formula,  we  have 

MB[H~,E8cE~]  =  MB[H~.E]  +  NF[Di,Mj]-(l  -  MB[H~tE]) 


This  measure  approaches  certainty  independently  as  partially  disconfirming  evidence 
is  acquired  and  the  order  of  the  acquisition  is  immaterial. 
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Lei  us  examine  the  relationship  between  the  sufficiency  and  necessity  factors. 
Consider  the  two  as  applied  respectively  to  the  following  statements, 

S'!  =  "if  disease  Di  is  present  then  manifestation  Mj  is  present." 
and,  S 2  -  "if  manifestation  Mj  is  absent  then  disease  Di  is  absent." 

Since  these  2  statements  have  a  contrapositive  relation,  a  100%  belief  in  one  state¬ 
ment  implies  a  definite  belief  in  the  other.  On  the  other  hand,  if  the  presence  of  Di 
does  not  alter  our  belief  in  the  presence  of  Mj,  that  is  we  believe  they  are  indepen¬ 
dent  of  each  other,  the  absence  of  Mj  will  provide  us  no  information  on  the  absence 
of  Intuitively,  since  their  measures  of  belief  agree  for  the  extreme  cases,  if  we 
assume  a  linear  relationship  between  the  two  factors  then  they  must  be  identical. 
Adopting  a  probabilistic  definition  for  MB,  we  can  easily  show  that  it  is  indeed  true 
that  NF [Di,Mj ]  =  SF[Mj,Di ]  (see  Appendix  11).  We  can  therefore  redefine  NF\_Di,Mf\ 
as  the  evidential  strength  fbr  the  statement: 

if  the  patient  have  disease  Dit  then  manifestation  Mj  is  present. 

This  measure  can,  therefore,  also  be  used  as  an  expectation  measure  of  the  pres¬ 
ence  of  a  manifestation  of  a  disease  hypothesis,  a  fact  that  is  exploited  in  the  con¬ 
trol  scheme. 


3.3.3.  Measure  of  certainty 

To  summarize  the  results  we  have  so  far,  we  have  considered  two  measures  of 
evidential  strength,  one  for  the  degree  of  confirmation  and  the  other  for  the  degree 
of  disconfirmation.  In  order  to  facilitate  comparisons  of  these  likelihood  measures 
of  competing  hypotheses,  their  two  associated  measures  of  belief  are  combined  into 
a  single  parameter  termed  the  measure  of  certainty,  MO  (certainty  factor  in  MYCIN), 
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according  to  the  following  definition: 


MC[H,E]  =  MB  [H,E ]  -  MB[H,E]. 


which,  as  will  be  shown,  does  have  some  intuitive  appeal. 


First  of  all,  it  is  clear  that  MC[H,E ]  lies  between  -1  and  +1  inclusive,  and  it  pro¬ 
vides  a  convenient  way  to  handle  indeterminacy  of  data, 


+  1 


MC[H,E]  = 


if  H  is  shown  to  be  certain, 

if  nothing  is  known  about  H, 

if  the  negation  of  H  is  shown  to  be  certain. 


In  cases  when  the  truth  or  falsity  of  a  piece  of  evidence  E\  is  not  known  with  cer¬ 
tainty,  for  example  when  E\  is  a  hypothesis  itself  or  the  presence  of  the  manifesta¬ 
tion  is  not  definitely  clear,  the  degree  of  belief  in  E\  as  reflected  by  the  MC  based 
upon  prior  evidence  E  is  used  to  adjust  the  measures  of  evidential  strength  as  fol¬ 
lows, 

SF'[ff,Ex]  =  SF[H,E  !}max(0,  MC[EhE]), 

NF'[H,E  i]  =  -NF[H,E  ^miniO.MClE  i.E]), 

This  clearly  satisfies  the  criterion  that  the  measures  of  belief  remain  unchanged 
when  the  truth  or  falsity  of  the  evidence  cannot  be  determined  (MC  [E  itE  ]  =  0). 

From  the  definitions,  we  immediately  see  that  MC[H,E  ]  =  -MC\H,E  ].  This  is 
intuitively  appealing  since  it  states  that  evidence  that  supports  a  hypothesis  dis¬ 
favors  the  negation  of  the  hypothesis  to  an  equal  extent.  In  Appendix  II  it  is  shown 
how  the  measures  of  belief  can  be  related  to  conditional  probabilities,  which  experts 
Eire  often  willing  to  use  to  quantify  inferences  (though  they  refuse  to  follow  the. 
assertions  to  their  logical  conclusions),  and  some  results  that  appeal  to  our  intuition 
are  derived. 
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Note  that  the  combining  function  always  causes  the  belief  measures  to  increase, 
regardless  of  the  relationship  between  new  and  prior  evidence.  However  in  the  case 
of  absolute  confirmation,  MB[H,E]  =  1,  we  would  like  MB[H,E ]  =  0  regardless  of  the 
disconfirming  evidence  E.  Similarly,  MB  [H,E]  =  1  should  imply  MB  [H, #  ]  =  0.  The 
case  where  MB[H,E]  =  MB[H,E]  —  1  is  contradictory  and  hence  MC  is  undefined. 
Combining  these  requirements  and  the  adjustment  due  to  uncertainty  in  the  evi¬ 
dence,  the  updating  functions  used  by  the  system  become: 


MB[H,E8cEj] 


0  if  MBlH.EStE^l, 

MB  [H,E  ]  +  SF[DiMJ\mvx.{QlMC\_Mf,E])'{l-MB[H,E])  o.vu. 
0  if  MB[H,E&E3]=  1, 

MB[H,E]  -  NF[Di,Mj]’min(0,MC[Mj,E]y(l-MB[H,E])  o.w. 


where  H  is  the  hypothesis  that  the  patient  has  disease  D *; 

the  new  piece  of  evidence  concerns  whether  Mj  is  present  or  absent; 
E  is  the  accumulated  evidence. 


3.3.4.  Evaluation  of  conditions 

Conditions  as  described  in  section  2.3  are  arbitrarily  complex  logical  expres¬ 
sions  used  to  describe  complex  clinical  states  or  simply  combine  closely  related 
disease  entities.  This  section  describes  how  a  measure  of  certainty  can  be  assigned 
to  reflect  the  truth  or  falsity  of  a  condition. 

Since  a  term  in  the  logical  expression  of  a  condition  can  be  the  name  of  a 
disease  entity,  indicating  the  sub-condition  that  the  disease  entity  is  present,  it  does 
not  have  a  binary  value  but  is  instead  determined  by  the  continuous  measure  of  cer¬ 
tainty  of  the  entity.  As  a  matter  of  fact,  it  might  be  desirable  to  replace  relational 
operations,  such  as  in  heavy-proteinuria;  urinanaly sis. protein  >  200  (mg  per  24 
hours),  with  functions  which  return  estimates  of  the  certainty  of  the  fact  that  the 
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relations  are  true,  since  the  criteria  they  describe  are  often  not  very  clear-cut.  We 
would  therefore  like  to  extend  the  measures  of  belief  to  combination  of  hypotheses 
with  logical  operators.  The  combining  functions  are  listed  below: 

(i)  Conjunctions  of  hypotheses: 

(a)  MB[Hx&H2,E]  =  uiin(MB  [H  y,E),  MB  [H2_ E  ]). 

(b)  MB  [H i&H & E  ]  =  max(MB  \H\,E  ],MB  [H%,E  ]). 

(ii)  Disjunctions  of  hypotheses: 

(a)  MBlHiVH&E]  =  max(MB  [H  hE  ],MB  [H  2E  ]), 

(b)  MB  [HyVH2,E  ]  =  min(MB  [H^.E \MB  [H~2,E  ]). 


The  corresponding  CF s  are  merely  calculated  from  their  definitions.  Function  (i)(a) 
indicates  that  the  measure  of  belief  in  the  conjunction  of  two  hypotheses  is  only  as 
good  as  the  belief  in  the  hypothesis  that  is  believed  less  strongly,  whereas  Function 
(i)(b)  indicates  that  the  measure  of  disbelief  in  such  a  conjunction  is  as  strong  as 
the  disbelief  in  the  most  strongly  dis  confirmed.  Function  (ii)  yields  complementary 
results  for  disjunctions  of  hypotheses.  Note  that  these  functions  are  in  accordance 
with  the  fuzzy  set  theory  [Zadeh65j.  It  is  not  hard  to  show  that  they  satisfy  the 
intuitive  criteria,  embedded  in  the  following  "truth  tables": 


MC[Hi&H2.E] 


MC[HltE ] 


■*-*  J 

+  1 

0 

*■  J 

-1 

+  1 

+1 

0 

-1 

0 

0 

0 

-1 

'1 

-1 

-1 

-1 
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+  1 

MC[Hz,E] 

0  -1 

MC\HVE ]  +1 

+  1 

+  1 

+1 

0 

+  1 

0 

0 

-1 

+  1 

0 

-1 

3.3.5.  Compatibility  factors 

Finally  in  this  section  we  consider  the  last  set  of  relations  between  disease 
hypotheses,  namely  the  "associations",  and  see  how  they  can  be  fit  into  the  scheme 
that  has  just  be  developed,  for  controlling  the  focus  of  attention  of  the  system.  The 
measure  of  certainty  of  a  disease  hypothesis  as  discussed  in  previous  sections,  is 
merely  a  "local"  estimate  of  the  degree  of  confirmation  or  disconfirmation  based 
upon  the  presence  and  absence  of  its  manifestations.  We  would  like  to  also  make 
use  of  the  compatibility  factors  of  its  associations  to  give  a  measure  of  how  well  the 
hypothesis  fits  in  with  the  other  hypotheses.  In  the  case  of  multiple  diagnoses  for  a 
patient  suffering  from  more  than  one  disease,  hypotheses  that  are  favored  by  those 
already  confirmed  should  be  given  more  attention  while  those  that  are  incompatible 
should  not. 

Since  a  compatibility  factor  is  a  measure  of  evidential  strength,  very  much 
similar  to  SF  (when  CF  >  0)  and  NF  (when  CF  <  0),  it  is  also  defined  in  terms  of 
measures  of  belief.  However,  confirmation  or  disconfirmation  of  a  hypothesis  should 
not  be  based  on  this  type  of  "model-generated"  evidence.  This  means  if  two  diseases 
often  occur  together  and  thus  have  a  high  CF,  the  confirmation  of  one  should  not 
directly  lead  to  the  confirmation  of  the  other,  but  rather  merely  focus  the  system’s 
attention  to  the  associated  disease  whose  evidence,  provided  by  its  manifestations, 
are  then  looked  for.  It  is  the  same  idea  as  not  allowing  a  manife station  to  be 
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confirmed  by  its  cause,  that  is,  the  system  is  not  allowed  to  ''generate"  data  based 
on  its  model  of  the  diseases.  Therefore,  a  separate  set  of  measures  of  belief  are 
used  to  incorporate  the  evidential  strength  of  the  associational  relations.  These  glo¬ 
bal  measures  are  computed  from  the  local  ones  that  are  described  in  the  previous 
sections;  they  are  assigned  the  same  values  before  the  consideration  of  the  associa¬ 
tions.  For  each  associational  relation  defined  by  CF[D  the  global  likelihood 

measures  of  are  updated  as  follows, 


(i)  if  CF  >  0, 

MB  '[D  i,E  &cEz] 

(ii)  if  CF  <  0, 

MB  \_B  \,E  &cE  2] 


0  if  MB'[DhEScE  2]=1, 

MB  '[D  lt  £  ] + CF  [D  ,,  D  8}  max(0,  MC  [D  z,  E  ]  )•  { 1  ■ -  MB  '[H,  E  ] )  o.w. 
0  if  MB'[DvEScE  2]=1, 

MB  '[B^,E]-CF[B  l,D2}max(0lMC[Dz,E])-  (l-MB  '[D^.E])  o.w . 


Therefore,  the  presence  of  a  disease  increases  the  measures  of  certainty  of  its  asso¬ 
ciated  diseases  and  decreases  those  of  the  incompatible  ones.  The  separation  of 
associational  evidence  from  those  used  for  confirming  disease  hypotheses  enables 
the  user  to  test  the  SF  and  NF  parameters  in  a  frame  by  considering  only  the 
relevant  manifestations  (the  local  context)  independently  of  other  frames. 


3.3.6.  Transfer  of  measures  of  belief 

Consider  the  following  relation:  the  presence  of  manifestation  M  implies  the 
patient  has  either  disease  or  disease  Dz.  If  the  patient  shows  that  manifestation 
and  we  have  now  definitely  ruled  out  the  possibility  of  D\  being  present,  logically  we 
should  be  able  to  conclude  that  the  patient  must  have  Dz,  without  acquiring  more 
evidence.  In  order  to  produce  such  inferential  behaviour,  we  allow  the  measure  of 
belief  of  a  disease  hypothesis  associated  with  one  of  its  manifestations  to  be 
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transferred  to  other  disease  hypotheses  that  have  the  same  manifestation,  when  the 
disease  is  disconfirmed.  It  is  very  easy  to  keep  track  of  the  required  information 
since  all  these  diseases  appear  in  the  evidence-of  slots  of  the  manifestation, 
together  with  their  corresponding  evidential  strengths.  Since  the  possible  causes  of 
a  manifestation  may  not  be  mutually  exclusive,  the  sum  of  the  evidential  strengths 
may  possibly  be  greater  than  one.  The  "non-overlap"  portion  of  the  measure  of 
belief  of  a  disconfirmed  disease,  the  part  corresponding  to  the  hypothesis  that  the 
disease  is  present  by  itself,  is  divided  among  other  undisconfirmed  hypotheses 
related  through  the  considered  manifestation  in  proportion  to  their  own  measures  of 
belief.  Here  we  have  assumed  the  simple  summation  rule  of  measures  of  belief, 
which  is  conceived  purely  on  intuitive  grounds  in  that  it  agrees  with  the  way  we 
manipulate  probabilistic  measures. 

Similarly,  when  a  disease  hypothesis  is  confirmed  all  measures  of  belief  associ¬ 
ated  with  its  manifestations  will  be  removed  from  any  other  related  hypotheses.  As 
a  result,  all  hypotheses  that  cannot  explain  significant  findings  other  than  those 
already  accounted  for  by  the  confirmed  disease  will  not  be  actively  considered 
unless  they  account  for  some  other  findings.  This  mechanism  for  transferring  MBs 
provides  a  simple  and  uniform  way  of  dealing  with  the  competition  among  disease 
hypotheses  to  explain  common  manifestations. 

3.3.7.  Summary 

The  reasoning  model  is  based  on  the  assumption  that  there  exists  something 
called  the  measure  of  belief,  an  entity  which  is  associated  with  each  fact  and  can  be 
quantified  on  a  scale  from  0  to  1  to  indicate  different  degrees  of  faith  in  the  given 
fact.  It  was  shown  how  the  necessity,  sufficiency  and  compatibility  factors  are 
defined  with  respect  to  this  parameter  and  it  has  been  demonstrated  that  their 
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properties  do  appeal  to  our  common  sense  about  confirmation  theory.  We  have  also 
introduced  the  distinction  betweena  local  and  global  measures  of  belief,  and  the  idea 
of  transferring  MB s.  as  a  means  of  modeling  the  competitive  and  cooperative 
behaviour  among  the  hypotheses.  A  more  general  formulation  of  separate  measures 
of  belief  and  disbelief  can  be  found  in  [Shafer?6]. 

Inspired  by  MYCIM’s  formalism,  we  have  created  a  uniform  inference  scheme,  in 
which  every  disease  entity  in  the  knowledge  base  is  viewed  as  a  node  in  a  large 
inferential  network.  In  such  a  network,  the  evidenve-of  links  form  the  connections 
among  these  entities  and  some  measures  of  likelihood  are  propagated  among  the 
nodes  so  that  the  impact  of  directly  observable  facts  may  be  reflected  on  the  diag¬ 
nostic  consequences  of  ultimate  interest.  This  approach  is  also  taken  by  Duda,  Hart 
and  Nilsson  [Duda76,  77]  in  their  inference  net  formalism.  Unlike  g  m  c  chan  ism 
used  in  MYC1N,  their  system  (PROSPECTOR)  does  not  require  separate  treatment  of 
belief  and  disbelief,  nor  does  it  require  the  attainment  of  a  given  level  of  certainty 
before  a  rule  can  be  used.  The  propagation  scheme  used  is  Bayesian  in  its  heritage 
and  thus  suffers  from  typical  distortions  which  the  Bayesian  methodology  can  intro¬ 
duce.  Pednault,  et  al,  have  also  shown  some  of  the  problems  that  this  scheme  may 
run  into  if  the  space  of  hypotheses  is  exclusive  and  exhaustive  [Pednault79].  Here 
we  retain  the  full  scoring  scheme  of  the  MYC1N  system,  which  gives  us  a  better 
model  of  uncertainties  of  hypotheses  at  the  cost  of  having  to  do  more  work  in  main¬ 
taining  the  separate  belief  measures. 

It  was  mentioned  in  the  last  chapter  that  the  different  factors  are  classified,  on 
a  discrete  scale  of  five  levels  for  the  sake  of  users’  convenience.  This  is  viable  only 
if  the  control  scheme  is  not  too  sensitive  to  perturbations  in  these  numerical 
parameters.  It  will  require  the  scheme  not  to  rely  heavily  on  such  parameters  but 
more  on  the  existence  of  the  links  themselves.  In  the  following  sections  we  shall 
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describe  how  the  use  of  simple  heuristics  and  associational  information  can  provide 
attention-focusing  capability  in  the  control  structure. 


3.4.  Input  processing 

As  mentioned  in  a  previous  section,  each  hypothesize-and-test  C3'cle  in  the  con¬ 
trol  sequence  is  started  with  the  input  of  new  data.  Here  we  shall  describe  how  the 
data  are  processed  to  update  the  disease  model,  reflected  by  the  measures  of  belief 
of  the  hypotheses. 

When  the  user  gives  a  yes-no  response  to  a  system’s  request  or  enters  an 
observed  finding,  the  measure  of  certainty  associated  with  this  piece  of  information 
is  assumed  to  be  definite,  i.e.  +1  for  a  positive  response  and  -1  for  a  negative  one, 
unless  it  is  explicitly  modified  with  an  MC  enclosed  in  parentheses.  This  allows  the 
system  to  incorporate  the  laser’s  uncertainties  readily  into  its  decision  processes. 
As  will  be  discussed  later,  it  also  allows  a  simple  interface  to  subsystems  for  data 
interpretation. 

Upon  the  indication  of  the  presence  of  a  disease  entity  all  its  attributes,  if  any, 
will  be  requested  from  the  user.  Its  measure  of  certainty  is  then  used  to  update 
those  of  its  possible  causes  through  the  evidence-of  links,  making  use  of  the  combi¬ 
nation  functions  developed  in  section  3.3.  The  evidential  strength  is  propagated  up 
the  disease  hierarchy  by  recursively  applying  the  function  for  its  ancestor  entities. 
After  the  input  has  been  completed,  the  global  evaluation,  which  can  also  be  done 
incrementally,  is  carried  out  taking  into  consideration  only  the  compatibility  among 
active  hypotheses.  Note  that  only  findings  which  are  important  will  be  entered  into 
the  list,  of  unaccounted  for  findings.  Since  part  of  the  termination  criterion  is  that 
this  list  must  be  empty,  they  must  be  accounted  for  by  at  least  one  of  the  final 
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diagnoses. 


When  all  the  data  and  findings  in  a  test  have  been  input,  the  process  of  data 
interpretation  is  performed  by  checking  all  the  conditions  associated  with  that  test 
type.  A  final  note  about  data  input:  when  the  user  enters  a  high-level  disease 
hypothesis,  as  a  means  of  directing  the  system’s  attention,  it  is  treated  as  a  piece 
of  evidence  used  in  the  evaluation  of  the  global  likelihood  measure. 


3.5.  Hypothesis  generation 

A  striking  aspect  of  the  history- taking  process  revealed  by  detailed  protocol 
analysis  [Kassirer78]  is  the  sharp  focus  of  the  clinician's  problem-solving  behavior. 
Hypotheses  are  quickly  generated  even  when  there  are  relatively  few  facts  known 
about  the  patient.  These  hjrpotheses,  which  may  be  general  or  very  specific,  provide 
a  basis  for  expectation:  they  identify  the  relevant  clinical  features  that  should  prove 
fruitful  for  further  investigation.  Another  striking  aspect  of  the  process  observed 
was  the  small  number  of  active  hypotheses  maintained  by  the  clinicians  at  any  one 
time,  owing  to  the  limited  capacity  of  human  short-term  memory.  This  section 
describes  this  process  of  hypothesis  activation  and  confirmation,  and  examines  the 
decision  criteria  involved. 

A  frame  can  exist  in  one  of  five  states:  dormant,  semi-active,  active,  accepted 
and  rejected,  depending  on  the  values  of  its  different  likelihood  measures  as  defined 
previously  (see  Figure  3.4-1).  Initially,  the  short-term  memory  contains  no  frames: 
all  frames  are  in  the  long-term  memory  and  are  said  to  be  in  the  dormant  state. 
Ah  their  measures  of  belief  (MB)  are  initially  set  to  zero,  indicating  the  absence  of 
any  confirming  and  disconfirming  evidence.  Frames  are  activated  and  brought  into 
the  short-term  memory  through  the  evidenee-of  links  of  the  input  findings.  As  data 
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are  gathered  the  likelihood  measures  are  updated  by  the  combining  functions  given 
in  section  3.2.  A  frame  becomes  active  when  its  global  measure  of  certainty  ( MC ') 
is  greater  than  a  pre-specified  threshold  'Oactive-  An  active  hypothesis  corresponds  to 
one  the  physician  is  consciously  thinking  about.  Semi-active  hypotheses,  whose 
likelihood  measures  are  below  the  threshold,  bridge  the  gap  between  the  active  and 
inactive  ones  and  allow  us  to  represent  hypotheses  which  are  not  actively  under  con¬ 
sideration  but  which  may  be  "in  the  back  of  the  physician’s  mind".  In  order  to  be 
able  to  handle  multiple  diagnoses,  we  need  to  define  a  criterion  for  acceptance  of  a 
hypothesis.  When  the  MB,  a  measure  of  confirmation,  of  a  hypothesis  has  reached  a 
higher  threshold  ^accepted  as  confirming  evidence  is  accumulated,  the  hypothesis  is 
said  to  be  accepted.  Similarly,  a  hypothesis  is  rejected  when  the  MB  of  its  negation 
rises  above  the  threshold  'O accepted •  Upon  the  confirmation  of  a  hypothesis,  all 
findings  that  are  accounted  for  by  the  confirmed  disease  will  be  removed  from  the 
list  of  input  data,  and  all  activated  hypotheses  that  no  longer  show  manifestations  in 
this  new  list  will  then  be  returned  to  the  list  of  semi-active  hypotheses.  New 
findings  that  can  be  explained  by  an  already  confirmed  hypothesis  wrill  not  cause  the 
activation  of  any  new  hypotheses.  Note  that  if  both  the  acceptance  and  rejection 
criteria  are  satisfied,  the  status  of  the  hypothesis  is  ambiguous.  It  means  that  the 
system  has  been  presented  with  contradictory  data  (unless  the  knowledge  base  is 
not  properly  set  up)  and  therefore  the  input  should  be  re-examined. 

The  distinction  as  to  whether  a  hypothesis  is  either  active  or  semi-active  is 
important  since  it  marks  whether  the  hypothesis  will  play  a  vigorous  role  in  the 
disease  model.  It  is  only  when  a  hypothesis  becomes  active  that  its  support  is  pro¬ 
pagated  up  to  higher  levels  and  its  associational  relations  are  considered.  The  value 
of  -d active  is  chosen  as  a  compromise  between  two  conflicting  factors.  We  would  like 
to  assign  it  a  high  value  so  as  to  reduce  directly  the  number  of  hypotheses  that 
need  to  be  actively  considered  and  indirectly  the  number  of  high-level  hypotheses 
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they  may  trigger.  This  serves  as  the  first  step  to  focusing  the  attention  of  the  sys¬ 
tem.  The  lumping  together  of  findings  with  causally  consequent  diagnoses,  both  as 
manifestations,  leads  INTERNIST  to  some  difficulties.  Since  no  measures  of  likeli¬ 
hood  of  the  hypotheses  are  maintained  by  the  system  (the  acceptance  criterion  is 
based  on  how  competing  hypotheses  score  relatively  on  an  arbitrary  scale),  manifes¬ 
tations  can  only  be  discretely  classified  as  either  present,  absent,  or  unobserved. 
This  may  be  appropriate  for  findings,  but  when  imposed  on  the  evaluation  of  diag¬ 
noses,  it  ignores  the  arguably  real  support  of  a  strongly  suspected  though  not 
confirmed  causally  consequent  diagnosis  for  its  antecedent.  As  Pople  has  pointed 
out  [Pople77],  this  effect  may  prevent  INTERNIST  from  diagnosing  a  syndrome  of 
connected  hypotheses  if  no  one  of  them  is  definitely  provable  even  though  the  cir¬ 
cumstantial  evidence  of  their  combined  high  likelihood  is  convincing  to  a  physician. 
If  the  threshold  is  set  to  a  very  high  value  (  as  T?accepied),  the  difficulty  described 
above  will  arise  since  partial  support  from  highly  likely  but  unconfirmed  evidence 
cannot  be  given  to  higher  levels. 
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Figure  3.4-1  State  transitions  of  the  hypotheses 
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3.6.  Question  selection 


The  system  tries  to  establish  the  disease  hypotheses  by  requesting  relevant 
data  from  the  user.  Different  strategies,  depending  on  the  number  of  active 
hypotheses,  their  likelihood  estimates  and  the  cost  associated  with  the  acquisition  of 
the  data,  are  used  for  selecting  the  question  so  as  to  minimize  the  duration  of  the 
session.  There  are  currently  three  modes  of  investigation,  following  the  general  idea 
of  INTERNIST’S  scheme. 

When  there  is  no  prominent  hypothesis,  that  is,  one  that  has  a  substantially 
higher  measure  of  certainty  or  nearly  reaches  the  confirmed  state,  disconfirm  mode 
is  used.  Attempts  are  made  to  rule  out  the  active  hypotheses  one  by  one.  In  this 
mode  manifestations  with  a  very  high  frequency  of  occurrence  In  the  disease  being 
processed,  as  reflected  by  NF ,  are  requested.  Such  questions  stand  a  good  chance 
of  eliminating  one  or  more  of  the  considered  models.  The  cost  associated  with  the 
data  type  is  taken  into  consideration  so  that  less  expensive  items  are  asked  first. 
Because  of  the  high  cost  required  for  the  acquisition  of  laboratory  data,  disconfirm, 
mode  Is  not  used  when  the  type  of  questioning  has  reached  the  level  of  laboratory 
procedures.  In  such  cases  or  when  the  number  of  active  hypotheses  has  fallen 
within  a  pre-specified  number,  the  discriminate  mode  will  be  used.  In  this  mode  the 
top  two  diagnoses  are  selected  for  discrimination.  Items  that  count  heavily  for  one 
model  while  count  little  or  nothing  for  the  other,  that  is  those  with  great  difference 
in  th  SFs  for  the  two,  are  desiderata  for  questioning.  Finally,  if  there  is  a  prom¬ 
inent  hypothesis,  the  confirm  mode  is  used.  Questions  are  selected  that  are  thought 
to  have  a  good  chance  of  being  "clinchers''  for  the  hypothesis  with  the  highest  likeli¬ 
hood.  Manifestations  with  high  SFs  with  respect  to  the  considered  model  are  asked. 
The  system  continues  in  confirm  mode  until  either  the  model  is  confirmed  or  until 
other  hypotheses  become  competitive.  In  the  former  case,  the  system  concludes 
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that  the  considered  disease  is  present  and  the  active  list  is  updated  as  described  in 
the  last  section. 

A  simple  measure  is  taken  to  avoid  constant  shifting  of  the  system’s  attention 
as  more  hypotheses  are  triggered  by  new  input  findings,  which  may  cause  the  user 
to  become  lost.  The  specification  of  a  strategjr,  such  as  confirm  Dy,  is  considered 
the  current  goal  of  the  system,  which  will  not  be  changed  unless  it  is  consistently 
contradicted  by  the  new  findings,  that  is,  the  evidence  is  building  up  on  the  wrong 
side.  This  "retention  of  attention"  mechanism  also  allows  a  more  coherent  explana¬ 
tion  of  the  reasoning  steps  of  the  system. 

3.7.  Termination  criteria 

Programs  which  deal  with  relatively  broad  domains,  such  as  PIP  and  INTERNIST, 
have  inadequate  criteria  for  deciding  when  a  diagnosis  is  complete.  They  are  unable 
to  judge  when  the  major  diagnostic  problems  have  been  resolved  and  only  the  "loose 
ends"  remain:  the  programs  continue  exploring  less  and  less  sensible  additional 
hypotheses  until  the  user  tires  of  the  consultation.  For  example,  PIP  only  stops  if 
no  active  hypotheses  remain  or  if  every  finding  of  every  active  hypothesis  has 
already  been  explored.  Some  measures  are  taken  to  try  to  avoid  this  problem. 

First  of  all,  it  is  important  that  a  criterion  for  the  confirmation  of  a  hypothesis 
be  established.  Normally,  the  consultation  is  terminated  when  all  input  findings  are 
accounted  for  by  at  least  one  of  the  accepted  disease  hypotheses.  In  order  to  get 
around  the  problem  of  pursuing  insensible  hypotheses  to  resolve  the  loose  ends,  the 
primitive  findings  are  classified  as  either  significant  or  insignificant  with  the  impor¬ 
tant  slot.  The  former  will  not  be  entered  into  the  list  of  unaccounted  findings  and 
therefore  need  not  be  covered  by  the  final  diagnoses.  Note  that  when  there  exist 
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uncertainties  in  the  evidences  the  system  may  still  have  to  examine  all  of  the 
findings  of  the  active  hypotheses  before  it  will  decide  to  quit.  This  kind  of  situation 
does  arise  in  real-life  and  may  require  deferring  the  decision  to  see  how  the  disord¬ 
ers  evolve  over  time. 

In  a  normal  termination,  there  may  remain  some  active  hypotheses.  Depending 
on  their  important-conditions  (according  to  the  criterion  given  in  the  next  section), 
they  may  be  considered  for  treatment.  However,  for  such  disease  hypotheses 
attempts  are  first  made  to  disconfirm  them.  Strongly  disconfirmatory  evidence  is 
gathered  until  the  criterion  is  no  longer  satisfied  or  until  the  evidence  is  exhausted. 


3.0.  Treatment  selection 

After  arriving  at  a  diagnosis,  the  system  proceeds  with  the  task  of  selecting  the 
proper  treatment  for  the  diseases  that  are  likely  to  be  present  in  the  patient.. 
Unfortunately,  in  many  clinical  situations  considerable  uncertainty  exists  about  the 
presence  or  absence  of  a  given  disease  because  no  further  confirmatory  diagnostic 
studies  are  available.  The  dilemma  of  whether  or  not  to  administer  a  particular 
drug  or  perform  a  particular  operation  on  a  patient  without  an  established  diagnosis 
is  familiar  to  the  physicians.  With  this  uncertainty,  administering  a  treatment 
known  to  be  effective  for  the  disease  under  consideration  will  be  beneficial  if  the 
disease  is  actually  present,  but  may  be  costly  in  terms  of  money,  unnecessary 
inconvenience,  and  some  definite  risk  or  undesirable  side  effects.  On  the  other 
hand,  failing  to  administer  the  treatment  may  be  deleterious  if  the  disease  is 
present  but  not  if  it  is  absent.  Methods  for  dealing  with  such  complex  therapeutic 
decisions  in  an  explicit  and  logical  fashion  have  been  developed  [Pauker?5j.  They 
include  the  construction  of  a  decision  tree  that  describes  all  the  possible  courses  of 
action  available  and  the  consequences  of  each;  obtaining  a  quantitative  estimate  of 
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the  probability  and  utility  (costs/benefits)  of  each  outcome;  and  combining  these 
estimates  into  a  measure  of  the  expected  utility  for  each  course  of  action.  The 
major  difficulty  with  these  methods  is  to  come  up  with  reasonable  estimates  of  the 
benefits  and  costs  for  the  purpose  of  comparing  factors  that  are  radically  different 
in  nature,  such  as  life  expectency  versus  money.  This  is  further  complicated  by  the 
fact  that  they  may  depend  on  each  individual  patient’s  economic  bakground,  moral 
values  and  those  of  the  society,  etc.  It  is  interesting  to  note  that  it  is  also  sug¬ 
gested  the  patients  should  be  allowed  full  participation  in  assessing  the  relative  mer¬ 
its  of  the  possible  outcomes  [Pauker77]. 

Here  we  have  followed  a  similar  idea  but  we  only  consider  what  we  think  is  the 
most  important  utility  measure,  namely  the  cost  of  not  treating  a  disease  if  it  is 
present.  This  measure  is  translated  into  a  condition  under  which  the  risk  is  too 
high  to  be  ignored.  When  this  condition  of  an  active  hypothesis  is  satisfied  after  the 
final  diagnosis  is  reached,  its  treatment  will  be  considered.  The  inclusion  of  this 
feature  is  motivated  by  the  desire  to  give  special  attention  to  malignant  diseases, 
when  there  is  substantial  evidence  for  their  presence.  In  the  protocol  analysis  cited 
before,  it  is  in  fact  observed  that  some  diagnostic  hypotheses,  although  deemed 
unlikely  by  the  clinicians,  were  considered  earnestly  because  of  the  potential 
benefits  of  therapeutic  intervention.  The  default  condition  is  NIL,  that  is,  the 
disease  will  not  be  treated  unless  it  is  confirmed. 

Once  the  set  of  diseases  to  be  treated  are  decided,  the  selection  of  therapy  will 
be  considered  separately  for  each.  This  merely  involves  going  through  the  list  of 
possible  treatments,  evaluating  their  associated  conditions  and  printing  only  those 
whose  conditions  are  satisfied.  Further  requests  may  be  generated  by  the  system  in 
evaluating  the  clinical  condition  of  the  patient. 
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3.9.  Subsystem  interface 


As  mentioned  in  the  introduction,  part  of  the  goal  of  our  system  is  to  utilize 
output  from  the  interpretation  of  test  data  by  several  other  knowledge-based  sys¬ 
tems.  These  systems  deal  with  tasks  of  different  natures  and  have  their  own  control 
structures  of  high  complexity.  The  question  is  then  how  we  should  define  the  inter¬ 
face  between  these  systems  so  that  cooperation  between  them  can  be  achieved.  The 
most  natural  type  of  communication  is  perhaps  in  the  form  of  predictions  and 
hypotheses,  exchanged  in  both  directions.  That  is,  each  system  informs  the  other 
system  what  is  expected  to  be  found  according  to  its  model,  before  it  relinquishes 
the  program  control,  and  in  return  it  is  given  the  hypotheses  generated  by  the 
other  system,  which  can  be  used  to  adjust  its  own  model.  It  is  clear  that  such  two- 

way  traffic  of  information  allows  the  construction  of  a  feedback  loop  and  thus  pro¬ 

gressive  enhancement  of  the  systems’  models.  Inferences  made  by  the  systems 
must  however  be  quantified  under  the  same  or  compatible  schemes. 

As  with  user’s  input,  the  diagnosis  system  can  accommodate  fuzzy  data,  in  the 
form  of  hypotheses,  from  those  subsystems.  On  the  other  hand,  as  discussed  in  sec¬ 
tion  3.3.2,  the  necessary  factors  can  be  used  to  estimate  the  likelihood  measures  of 

findings  from  the  active  disease  models,  using  the  same  set  of  updating  functions. 

This  constitutes  the  predictions  on  the  expected  findings,  and  can  be  used  by  the 
user  as  guidance  through  the  expect  command  (section  4.2).  Such  information  is 
very  useful  in  practice.  For  example,  in  auscultation  it  is  often  much  easier  to 
detect  a  murmur  if  it  is  specifically  searched  for.  For  the  same  reason  X-ray  films 
sent  to  a  radiologist  for  consultational  advice  are  documentated  with  the  suspected 
lesions. 
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3.10.  Consistency  considerations 


Since  the  criterion  for  hypothesis  confirmation  is  based  on  the  aggregate  evi¬ 
dential  strength  of  the  manifestations,  we  must  recognize  logical  subsumption  when 
estimating  the  likelihood  measure  and  thus  avoid  counting  evidence  more  than  once. 
For  instance,  if  E  ±  implies  then  MC[H,E i&E2]  =  MC[H,Ei]  regardless  of  the  value 
of  MC[H,E2l 

The  use  of  global  measure  of  certainty,  a  heuristic  device  for  controlling  the 
system’s  focus  of  attention,  has  alleviated  the  difficulty  that  arises  from  the  more 
common  sources  of  inconsistencies.  Evidence  provided  by  the  associationai  relations 
or  directly  supplied  by  the  user  (to  shift  the  system’s  attention)  very  likely  overlaps 
with  that  provided  by  the  manifestations.  This  is  one  of  the  reasons  why  the  former 
should  not  be  entered  in  the  confirmation  criterion.  This  also  makes  sense  regard¬ 
ing  the  fact  that  these  associations  are  less  well  understood  and  involve  greater 
uncertainty.  Similarly,  expectation  measures  based  on  the  NFs  are  only  used  in  the 
global  evaluation. 

Inherent  inconsistencies  in  the  knowledge  base  may  result  from  improper 
organization  of  the  data,  such  as  when  a  cycle  is  formed  in  the  PART-OF  heirareby. 
In  some  situations  the  effects  are  not  significant  and  can  be  tolerated.  Anyhow, 
since  such  inconsistencies  are  usually  not  apparent  to  the  user,  it  is  important  that 
the  system  provides  some  checking  during  the  process  of  data  acquisition  (see  sec¬ 
tion  4.4). 
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3.11.  Sample  interaction 


This  chapter  closes  with  a  sample  consultation  session  based  on  data  obtained 
from  a  recent  clinical  case.  Since  the  knowledge  base  is  incomplete,  it  is  only 
meant  for  illustrating  the  interaction  conventions  and  explanation  capabilities.  In 
the  trace  below,  computer  output  is  in  regular  font,  user  responses  follow  an  aster¬ 
isk  or  colon  and  are  in  bold  font,  and  commentary  appears  as  [indented  italics 

inside  braces],  indicates  that  more  requests  will  be  generated  for  a  more  com¬ 

plete  KB. 

This  is  a  system  for  providing  consultational  advice  on  diagnosis  and 
therapy  of  cardiovascular  diseases  ... 

Type  ’help’  for  the  list  of  instructions.  [see  section  4.2 } 

(You  are  in  the  Question  Answering  mode) 

*  start 

Please  enter  data  of  the  patient: 

Name? 

1:  Jane  Sample 

Sex? 

2:  female 

Age? 

3:  37 


Please  enter  medication  findings:  [recent  medications] 

4: 

Please  enter  initial  findings: 

4:  fever 

What  is  its  temperature? 

5:  40  [=C] 

What  is  its  duration? 

6:  3  [days] 
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7:  arthritis 

Is  it  multiple-joint? 

8:  yes 

Is  it  migratory? 

9:  yes 
Is  it  acute? 

10:  yes 

11:  subcutaneous  nodules  (0.7)  [Its  nature  not  clearly  known] 
What  is  its  location? 

12:  hands  shoulders 

13:  status 

Active  hypotheses: 

1)  acute  rheumatic  fever  (0.91)  J likelihood  measure] 

2)  serum  sickness  (0.42) 

Semi-active  hypotheses: 

1)  gonococcal  infection 

2)  rheumatoid  arthritis 

3)  gout 


13: 

Please  enter  data  of  the  blood  test: 

[blood  test  should  be  subdivided  into  groups  of  related  lab  procedures  and 
requested  separately,  such  as  CBC  (complete  blood  count)  and  dijfusion-and-smear] 
ASO? 

13:  why 

The  presence  of  elevated  ASO  titre  in  the  blood  test  helps  to 
confirm  the  presence  of  streptococcal  infection. 

13:  expect 

Findings  expected  in  the  blood  test: 

1)  elevated  ASO  titre  (0.82) 
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2)  elevated  ESR  (0.82) 


13:  1000  [units ] 
ESR? 

14:  60  [mm/hr] 


The  following  diseases  are  confirmed: 

1)  streptococcal  infection,  causing 
2)  acute  rheumatic  fever 

*  therapy 

Treatment  for  streptococcal  infection: 

1)  procaine  penicillin  G 

dosage  --  600000  units  twice  daily 
duration  —  6  days 

Treatment  for  acute  rheumatic  fever: 

1)  general 

instruction  --  bed  rest  until  the  patient  is  afebrile,  gaining 
weight  and  the  ESR  is  normal 

2)  aspirin 

dosage  --  80  mg/kg  body  weight  daily 

goal  --  maintain  serum  salicylate  level  at  30mg/100ml 

*  what  are  the  manifestations  of  ARF  that  are  not  reported 

They  are: 

1)  rheumatic  carditis 

2)  erythema  marginatum 

3)  arthralgia 

4)  Syndenham’s  chorea 

*  stop 
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Chapter  4 


User  Interface 

4.1.  Introduction 

This  chapter  addresses  some  of  the  human  engineering  issues  in  the  design  of  a 
user  interface  for  our  system.  The  possibilities  of  implementing  an  explanation  sys¬ 
tem,  a  data  acquisition  system  and  a  graphic  interface  are  considered.  The  poten¬ 
tial  value  of  natural  language  and  graphic  interaction  techniques  as  man-machine 
communication  media  is  examined.  These  facilities  are  designed  to  provide  effective 
interaction  between  the  user  and  the  system,  allowing  quick  retrieval  of  information 
and  maintenance  of  system’s  knowledge,  with  the  hope  of  parallelling  MYCIN’s  capa¬ 
bilities  in  this  respect  and  to  demonstrate  the  attractiveness  of  the  knowledge 
representation.  Since  the  success  of  implementing  such  facilities  very  much 
depends  on  a  proper  design  of  the  data  and  control  structures,  we  pursue  their 
development  even  at  this  early  stage.  Each  of  these  subsystems  mentioned  above 
will  be  described  individually  in  the  following  sections.  We  shall  first  describe  the 
convention  for  data  input  and  steps  that  have  been  taken  to  make  it  simple. 

4.2.  Input  convention 

Since  the  convention  for  communication  between  the  system  and  the  physician 
is  an  important  factor  that  determines  the  system’s  acceptability,  a  number  of 
features  are  designed  to  simplify  the  interactive  process  that  occurs  when  the  sys¬ 
tem  requests  information  from  the  user.  Guidance  is  provided  to  help  the  user  in 
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making  the  correct  (system-recognizable)  response. 


There  are  two  possible  input  modes,  incomplete  and  complete,  that  can  be 
selected  at  the  start  of  the  session.  In  the  incomplete  mode,  signs  and  laboratory 
data  are  requested  individually  as  they  are  needed  for  making  decisions.  This  allows 
the  user  to  track  more  closely  the  steps  that  the  system  takes  in  arriving  at  the 
final  diagnosis  (c/.  MYCIN  system).  However,  it  hinders  efficient  use  of  the  available 
data.  The  complete  mode  allows  findings  to  be  input  as  groups  according  to  their 
acquisition  procedures.  For  example,  instead  of  asking  a  question  like 

Is  prolonged  PR  interval  in  the  ECG  findings  ? 

What  is  the  ASO  tiire  in  the  blood  test  ? 

the  system  will  say 

Please  enter  the  ECG  findings: 

Please  enter  the  blood  test  findings: 

In  response,  the  user  will  have  to  type  in  a  complete  list  of  the  findings  (abnormali¬ 
ties)  for  the  specified  test.  Any  findings  that  are  not  listed  will  be  considered 
absent.  This  is  why  the  distinction  between  specific  and  initial  findings  is  necessary 
--  the  former  are  always  specifically  asked  for.  In  this  case  data  can  be  processed 
and  entered  into  the  hypothesis  evaluation  as  soon  as  they  become  available.  The 
duration  of  the  session  may  therefore  be  reduced  considerably.  However,  more 
effort  is  required  on  the  part  of  the  user  to  make  the  system  recognize  his 
responses  since  the  responses  are  not  of  the  yes-no  type  as  in  the  incomplete  mode. 
An  option  is  provided  to  generate  all  the  possible  findings  of  a  test  so  that  the  user 
can  readily  check  off  the  ones  that  are  present.  This  is  however  ineffective  when 
only  a  few  have  to  be  chosen  from  a  very  long  list.  Facilities  for  input  correction 
can  be  very  helpful  here.  Spelling-correction  routines  may  be  used  to  see  if  a  sim- 


-84- 


pie  spelling  or  typographical  error  will  account  for  an  unrecognized  response. 
Another  correction  mechanism  involves  the  decomposition  of  a  disease  identifier  into 
its  constituents  so  as  to  allow  limited  reordering  and  synonym  replacement  that 
may  account  for  descrepancies  between  the  user’s  response  and  recognized  answers, 
such  as  among  elevated  ESR,  ESR  elevated :  ESR  increased,  and  increased  erythro¬ 
cyte  sedimentation  rate.  The  dictionary  required  for  this  purpose  is  built  up 
through  learning  from  examples  supplied  by  the  user  (similar  to  learning  structural 
description  from  examples  described  in  [Winston75]). 

In  addition,  the  user  is  given  a  number  of  options  that  may  be  utilized  whenever 
he  is  asked  a  question  : 

start  -  used  to  begin  a  consultation  session  for  a  new  case. 

unknown  -  (may  be  abbreviated  u  or  unk)  used  to  indicate  that  the  answer  to  the 
question  is  not  known,  usually  because  the  data  are  either  unavailable 
or  hard  to  obtain. 

?  -  used  to  request  the  specific  finding  of  a  test  that  the  system  is 

currently  interested  in. 

??  -  used  to  generate  a  list  of  all  possible  findings  of  the  requested  test  so 

that  the  findings  can  entered  as  yes- no  response. 

why  [<number>] 

-  used  to  request  an  explanation  of  the  question  identified  by  the  specified 

<number>,  the  current  one  if  not  supplied. 

expect  [<type>] 

-  used  to  request  a  list  of  findings  that  are  expected  in  the  specified  test 
<type>,  the  one  being  requested  if  not  supplied  (as  explained  in  section 
3.9). 

status  [<entity>  [assuming  <expression>]] 

-  used  to  request  the  status  of  the  specified  disease  <ent.it.y>,  assuming 
the  given  <expression>  is  definitely  true  (or  is  not  allowed).  Status 
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alone  gives  the  lists  of  hypotheses  with  the  various  possible  status. 

correct  <entity> 

-  used  to  change  the  value  of  an  input  data  item  (question  id.  can  be  used 
in  place  of  <entity>),  or  adjust  the  system  maintained  likelihood  meas 
ure  for  a  disease  hypothesis. 

<slot  type>  <entity> 

-  used  to  display  the  specified  type  of  slots  (e.g.  description,  manifesta¬ 
tions)  of  the  given  disease  <entity>. 

confirm,  disconfirm,  discriminate  |<entity>| 

-  used  to  overrule  the  investigation  mode  selected  by  the  system  (as 
described  in  section  3.6). 

-  used  to  request  the  treatment  for  diseases  that  are  already  confirmed 
or  deserve  special  consideration. 

-  used  to  enter  the  data  acquisition  mode  for  authorized  user. 

-  used  to  digress  temporarily  in  order  to  use  the  explanation  system.  A 
question  can  be  entered  on  the  same  line,  in  which  case  the  explanation 
system  is  exited  immediately  after  the  response  is  generated,  otherwise 
exited  with  an  empty  line.  (These  features  of  this  system  are  explained 
in  the  next  sections.) 

save  <filename> 

-  saves  the  status  of  the  current  consultation  in  the  file  <filename>.rec. 

stop  -  halts  the  current  consultation. 

end  -  exits  from  the  system, 

help  -  prints  this  list. 

(’[]'  indicate  optional  items,  'fj'  repeatable  items  and  Y  alternatives.) 

Here  we  just  briefly  mention  how  changes  to  previous  input  are  handled.  Such 
a  change  merely  involves  reassigning  a  new  measure  of  certainty  to  the  affected 
disease  entity,  and  propagating  the  effect  of  the  adjustment  to  higher  levels  if 


therapy 

DA 

QA 
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necessary,  just  as  inputting  a  new  piece  of  data.  No  extra  data  structures  are 
needed  to  permit  this  kind  of  backing  up  and  it  is  not  necessary  to  restart  the  con¬ 
sultation  from  the  beginning.  This  is  possible  because  the  reasoning  process  con¬ 
sists  of,  to  a  large  extent,  independent  hypothesize-and-test  cycles,  unlike  the  single 
backward-chaining  in  some  production  systems. 


4.3.  Explanation  system 

As  was  emphasized  in  the  first  chapter,  one  of  the  primary  requirements  for 
user  acceptance  of  a  consultation  program  is  an  ability  to  explain  decisions.  It  will 
also  be  very  helpful  to  the  user  to  be  allowed  to  examine  relationships  between 
disease  entities  of  interest  or  other  information  in  the  knowledge  base.  The  simpli¬ 
city  and  uniformity  of  the  control  and  data  structures  has  greatly  simplified  the 
implementation  of  such  capabilities  in  the  system.  It  is  automatically  invoked 
before  and  at  the  end  of  each  consultation  session,  and  may  be  accessed  optionally 
during  the  consultation  itself  through  the  QA  (question-answering)  command. 

Since  decisions  are  explained  only  in  response  to  queries  from  the  user,  the 
explanation  system  is  merely  a  question-answering  system.  The  ability  to  answer 
questions  obviously  requires  that  the  queries  be  understood.  In  order  to  minimize 
special  training  needed  for  use  of  the  system,  it  is  desirable  to  let  the  physician  ask 
questions  using  simple  English.  Writing  programs  to  understand  natural  language  is 
however  rather  complex  because  of  the  many  ways  that  individuals  may  choose  to 
express  themselves.  Several  powerful  techniques  have  been  developed  [Simmons70] 
[Winograd72],  they  however  suffer  from  being  either  somewhat  slow  computationally 
or  difficult  to  generalize  in  domains  other  than  those  for  which  they  were  designed. 
Since  physicians  will  quickly  reject  a  system  that  takes  a  few  minutes  to  answer  a 
question,  speed  of  response  rather  human-style  discourse  is  emphasized  here.  Yet 
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the  system  should  be  powerful  enough  to  answer  most  questions  that  a  physician 
might  want  to  ask.  Other  important  objectives  of  the  design  are  generality  and 
flexibility  for  future  expansion. 


4.3.1.  Basic  approach 

There  are  basically  two  types  of  questions  that  the  user  is  expected  to  ask  — 
those  that  require  information  from  the  knowledge  base  and  those  that  do  not.  Dur¬ 
ing  a  diagnosis  session  the  system  keeps  track  of  a  host  of  records  on  the  status 
information  that  may  be  of  interest  to  the  user.  For  instance,  associated  with  each 
hypothesis  are  different  likelihood  measures  and  there  are  lists  of  hypotheses  in  the 
different  possible  states.  Therefore,  the  following  questions  can  be  easily  answered 
by  examining  the  appropriate  records: 

are  there  any  accepted  diseases  ? 

what  are  the  hypotheses  that  are  active  or  semi-active  ? 
what  is  the  status  of  congestive  heart,  failure  ? 
is  pulmonary  edema  rejected  ? 

A  number  of  how  and  why  questions  can  be  answered  by  listing  either  the  support¬ 
ing  evidence  or  disfavoring  evidence,  e.g. 

how  do  you  conclude  that  acute  rheumatic  fever  is  present  ? 
why  do  you  reject  malignant  hypertension  ? 

In  order  to  answer  questions  about  the  purpose  of  each  test  requested  by  the  sys¬ 
tem,  a  record  of  the  type,  the  disease  hypothesis  being  investigated,  its  manifesta¬ 
tion  that  is  expected  from  the  test  and  the  mode  of  investigation  is  kept  for  each 
requested  test.  If  we  have  the  record  (disconfirm  ECG  myocarditis  prolonged-PR- 
interval),  in  response  to  the  question: 
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why  do  you  ask  for  ECG  ? 


by  filling  in  a  pre-defined  form,  the  system  will  reply: 

The  absence  of  prolonged  PR  interval  in  the  ECG  would  help  to  disconfirm  myocar¬ 
ditis. 

Besides  being  able  to  answer  questions  on  information  about  a  specific  patient 
MYC1N  can  handle  general  questions  about  the  system’s  rule-based  knowledge.  This 
serves  as  a  very  useful  debugging  and  educational  tool,  and  similar  facility  will  be 
helpful  to  our  system.  Most  of  such  questions  handled  by  MYCIN  involve  the 
retrieval  of  rules  by  matching  either  their  conclusions  or  premises  with  the  ques¬ 
tions.  The  challenge  is  therefore  the  normalization  of  sentences  for  direct  compari¬ 
sons.  Here,  a  major  portion  of  the  knowledge  of  our  system  lies  within  the  cause- 
manifestation  relations  between  disease  entities  and  their  characteristics  such  as 
type.  The  response  to  a  question  about  this  knowledge  will  involve  the  retrieval  of 
disease  entities  that  have  certain  properties  or  attributes.  Much  of  the  effort  has 
been  placed  here  to  handle  this  type  of  questions,  e.g. 

which  active  hypothesis  has  the  symptom  chorea  ? 

which  disease  is  active  and  is  a  possible  cause  of  chorea  ? 

what  are  the  names  of  the  diseases  that  show  chorea  and  are  active  ? 

are  any  causes  of  chorea  active  ? 

Note  that  all  of  the  above  are  asking  basically  the  same  question,  and  so  we  can  see 
that  the  user  is  allowed  to  express  himself  in  a  great  number  of  ways.  The  chal¬ 
lenge  here  is  then  to  be  able  to  break  up  these  sentences  into  their  semantic  com¬ 
ponents  and  normalize  them  so  as  to  reduce  the  number  of  cases  that  need  to  be 
considered. 
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The  interpretation  of  a  sentence  is  divided  up  into  4  disjoint  processes  as  shown 
in  Figure  4.3-1,  very  similar  to  the  compilation  and  execution  processes  of  a  com¬ 
puter  program.  After  the  '’first  pass”  over  the  sentence,  it  is  disintegrated  into  its 
basic  syntactic  components  and  organized  into  a  nested  format  reflecting  the  struc¬ 
ture  of  the  input  sentence.  Preliminary  normalization  based  on  recognition  of 
synonyms  is  also  performed.  It  is  only  until  the  second  phase  is  reached  will  the 
meaning  of  the  sentence  be  interpreted.  Questions  of  the  first  type  described  above 
are  handled  here  by  some  specialized  procedures.  The  others  are  compiled  into 
nested  expressions  of  set  functions,  a  meaning  representation  Language  (MRL),  each 
of  which  determines  if  certain  characteristic  or  relationship  between  disease  entities 
is  true.  The  sentence  is  further  normalized  at  this  level,  across  different  syntactic 
constituents.  The  expression  is  then  restructured  into  a  form  more  appropriate  for 
execution.  The  final  response  is  generated  by  the  collaborative  efforts  of  the  indivi¬ 
dual  functions  under  the  supervision  of  an  interpreter.  These  processes  and  the 
intermediate  data  structures  are  described  in  detail  in  the  following  sections. 

The  division  of  the  task  among  separate  processes  offers  several  advantages.  As 
shown  above  because  of  the  many  choices  of  words,  syntactic  structures  and  order 
of  these  words  and  structures  that  are  available  for  the  construction  of  a  sentence, 
there  are  numerous  possible  combinations  for  even  a  relatively  simple  sentence.  If 
each  of  these  choices  is  handled  separately,  the  number  of  combinations  needed  to 
be  considered  at  each  stage  will  be  greatly  reduced.  Modularization  of  the  system 
according  to  the  nature  of  the  subtasks  makes  its  more  understandable  and 
manageable;  these  processes  are  designed  to  be  independent  of  each  others  to  allow 
modifications  and  expansion  to  be  done  with  ease. 
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English  sentence 


> 

Syntax  analysis  (  dictionary,  ATN,  demons 
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< 

(normalized)  Syntax  structure 


attribute  expression 


> 

t 

Optimizer 

> 
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ordered  attribute  expression 


procedural  information 


attribute  handlers 


Figure  4.3-1  Interpretation  processes 
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4.3.2.  Syntax  analysis 


The  main  task  to  be  performed  in  this  phase  is  to  figure  out  the  part  of  speech 
of  each  word  and  how  they  combine  together  to  form  the  basic  syntactic  com¬ 
ponents  of  the  sentence.  A  deep-structure  is  constructed  to  record  this  information 
in  a  form  more  convenient  for  further  parsing,  by  capturing  the  regularities  of  the 
syntax.  Words  with  similar  meanings,  which  may  depend  on  their  parts  of  speech, 
are  recognized  and  replaced  by  the  "standard"  ones.  These  are  performed  with  the 
help  of  a  dictionary  and  augmented  transition  networks  (ATNs). 

The  dictionary  consists  of  all  the  words  that  may  appear  in  a  sentence  except 
the  names  of  the  disease  entities.  The  possible  parts  of  speech  of  each  word  are 
recorded  together  with  the  corresponding  equivalent  words,  e.g. 

what:  interrogative  pronoun  -»  what, 

interrogative  adjective  -*  which. 

(When  used  as  an  interrogative  pronoun,  "what"  is  taken  to  mean  "which  diseases" 
while  "which"  means  "which  of  the  above".  They  both  have  the  same  meaning  as 
interrogative  adjectives.)  A  group  of  words,  such  as  please  and  also  are  given  the 
type  noise.  They  do  not  carry  any  meaning  as  far  as  the  intended  answer  to  the 
question  is  concerned,  and  are  simply  ignored  during  the  interpretation.  This  helps 
to  simplify  the  grammar  and  make  the  parsing  process  more  efficient.  Unfor¬ 
tunately,  this  also  means  that  these  words  can  be  arbitrarily  placed  within  the  sen¬ 
tence.  Note  that  articles  are  not  igonred  since  they  provide  some  useful  cues  for 
handling  conjunctions  in  the  parsing.  In  the  input  process  words  that  are  parts  of  a 
disease  entity  name  are  grouped  together  to  form  a  single  token.  It  is  normalized 
into  its  formal  name  with  the  mechanism  mentioned  in  the  last  section  before  it  is 
passed  on  to  the  parser. 
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The  syntax  of  some  simple  English  sentence  constructs  is  represented  by  transi¬ 
tion  network  grammars  [Woods70].  They  provide  a  means  of  representing  English 
grammar  in  a  modular  graphical  fashion,  and  can  be  executed  like  a  finite  automa¬ 
ton  by  a  simple  interpreter  to  generate  deep-structure  representation  of  a  sentence. 
A  brief  overview  of  this  methodology  and  some  sample  networks  are  given  in  Appen¬ 
dix  III.  Figure  4.3-2  shows  the  deep-structure  (simplified)  of  a  sample  query  and  its 
corresponding  parse-tree. 

Note  the  limited  normalization  and  the  elimination  of  '’superfluous"  words. 
These  will  be  further  carried  out  at  a  higher  level  in  the  semantic  analysis.  It 
should  also  be  noted  that  the  syntax  enforced  by  our  parser  is  rather  loose  so  as  to 
allow  user  to  make  minor  grammatical  mistakes  that  do  not  alter  the  meaning  of 
the  sentence. 

It  is  rather  difficult  for  ATNs  to  handle  conjunctions  because  of  the  many  possi¬ 
ble  places  they  may  appear  in  a  sentence.  Instead  they  are  handled  by  a  demon,  a 
procedure  that  is  invoked  whenever  a  conjunction  is  spotted  in  the  input.  It  groups 
phrases  together  into  logical  expressions  as  shown. 

which  disease  show  S’  j  and  S2  but  not  S2  ? 

-»  ((int  .  what)  (VP  (v  .  show)  (OBJ  (n  .  (AND  S  i  S2  (NOT  S' 3)))))) 

The  grouping  must  be  done  at  the  proper  levels  between  two  phrases  of  the  same 
type.  The  ATNs  are  so  designed  that  each  low-level  network  corresponds  to  a  phrase 
type  repeatable  with  conjunctions.  The  grouping  can  therefore  be  accomplished  by 
matching  the  structures  in  the  data  stack  with  the  one  to  be  parsed,  starting  from 
the  top  until  they  match.  Adjustment  to  the  deep-structure  is  then  performed  at 
that  level.  The  words  no  and  not  are  handled  in  a  similar  fashion.  Semantic  infor¬ 
mation,  such  as  the  distinction  of  names  of  disease  entities  from  other  common 
nouns,  are  used  to  facilitate  the  handling  of  conjunctions. 
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what  is  the  name  of  the  active  hypothesis  which  has  an  ECO  sign  ? 


((int  .  what)  (VP  (v  .  be)  (OBJ  (n  .  name)  (PP  (prep  .  of)  (NP 
(adj  .  active)  (n  .  disease)  (AC  (VP  (v  .  have)  (OBJ  (adj  .  ECC- ) 
(n  .  manifestation))))))))) 


active 


disease  VP 


v 


ORJ 


have 


ECG  manifestation 


Figure  4.3-3  (b)  Parse  tree 
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4.3.3.  Semantic  analysis  and  interpretation 


At  the  start  of  the  semantic  analysis,  the  sentence  is  already  processed  into 
phrases,  organized  as  a  nested  expression  as  shown  in  the  previous  section.  Basi¬ 
cally,  the  analysis  is  also  carried  out  the  same  way  as  with  transition  networks.  The 
interpretation  of  the  input  expression  proceeds  from  the  top  level  inwards,  each 
type  of  structures  being  handled  by  a  special  procedure.  Each  of  these  procedures 
matches  the  particular  type  of  constituents  for  which  it  is  responsible  with  the 
expected  input  and  invokes  the  corresponding  routine  to  further  the  interpretation 
upon  a  match.  Questions  that  do  not  require  the  identification  of  certain  disease 
entities  will  be  answered  here  by  the  routine  which  is  eventually  passed  the  control. 

As  for  other  questions,  the  retrieval  of  disease  entities  with  a  specified  set  of 
attributes  essentially  involves  the  manipulation  of  sets  and  is  performed  in  three 
steps  (i)  setting  up  of  a  set  expression,  (ii)  reordering  the  expression,  and  (iii)  inter¬ 
preting  the  expression.  Consider  the  question, 

which  inactive  hypothesis  is  a  possible  cause  of  fever  or  has  the  symptom  dyspnea. 

It  is  deliberately  asked  in  a  clumsy  way  to  show  the  type  of  normalization  that  can 
be  done  at  this  level.  Let 

A  =  set  of  all  active  diseases, 

B  =  set  of  all  possible  causes  of  fever, 

C  =  set  of  all  possible  causes  of  dyspnea. 

The  answer  is  then  given  by  the  set 

A  n  {B  U  c) 

Some  of  these  sets,  such  as  B  and  C,  are  readily  available  from  the  knowledge  base 
or  the  status  records  while  the  membership  of  others  has  to  be  determined 
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individually.  For  example,  in  order  to  find  the  set  of  ail  diseases  that  have  an  ECG 
sign,  we  have  to  look  for  its  members  by  checking  each  disease  to  see  if  at  least  one 
of  its  manifestations  is  an  ECG  sign.  A  difficulty  immediately  arises  from  the  con¬ 
straint  imposed  by  the  size  of  the  universal  set,  the  set  of  all  disease  entities.  It  is 
too  large  to  allow  sets  to  be  efficiently  constructed  in  a  way  that  requires  examining 
the  members  of  the  universal  set  one  by  one,  and  it  should  be  considered  an  infinite 
set  for  all  practical  purposes.  In  particular,  we  cannot  enumerate  the  negation  of  a 
set  for  this  reason.  Therefore,  in  order  to  obtain  the  answer  to  the  above  question 
we  have  to  reorder  the  expression.  The  set  of  all  possible  causes  of  fever  or 
dyspnea  is  first  computed  by  taking  the  union  of  B  and  C.  Each  member  of  this 
(enumerable)  set  will  then  be  checked  to  see  if  it  belongs  to  A.  A  simple  yet  gen¬ 
eral  algorithm  is  developed  to  handle  an  arbitrarily  complicated  set  expression. 

The  translation  of  the  deep-structure  into  the  initial  expression  of  set  functions 
is  fairly  straight-forward  since  the  attributes  are  all  separated  and  grouped  into  the 
prefix  format  by  the  parser.  There  are  two  types  of  set  functions,  which  will  be 
referred  to  as  FLAG  and  SET.  A  SET  function  returns  the  explicit  set  of  disease  enti¬ 
ties  with  the  associated  attribute.  A  FLAG  function  takes  a  disease  entity  as  its 
parameter  and  determines  if  it  has  certain  attribute.  They  handle  the  enumerable 
and  inenumerable  sets  respectively.  As  can  be  seen  from  the  example  given  above, 
SET  functions  should  be  evaluated  before  the  FLAG  functions  to  ensure  the  inter¬ 
mediate  result  be  enumerable,  if  possible.  In  the  first  step  the  set  expression  is 
processed  into  the  following  form, 

(AND  (NOT  (SET  .  (ACTIVE-SET)))  (OR  (SET  .  (CAUSE-SET  fever)) 

(SET  .  (CAUSE-SET  'dyspnea)))) 

Note  that  only  the  individual  terms  (functions)  arc  labelled  with  their  corresponding 
types  and  this  labelling  will  be  carried  on  to  the  other  levels  in  the  next  step.  It 
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starts  from  the  innermost  expression  and  works  outwards  one  level  at  a  time.  At 
each  level  the  terms  are  reordered  so  that  those  labelled  SET  come  first,  and  the 
subexpression  is  then  labelled  according  to  the  following  rules: 

(i)  a  NOT  expression  is  always  labelled  FLAG, 

(ii)  an  AND  expression  is  labelled  the  same  as  the  first  term  (i.e.  an.  AND  expression 

is  labelled  SET  iff  it  has  at  least  one  SET  term), 

(iii)  an  OR  expression  is  labelled  the  same  as  the  last  term. 

The  above  expression  will  therefore  become 

(SET  .  (AND  (SET  .  (OR  (SET  .  (CAUSE-SET  'fever))  (SET  .  (CAUSE-SET  'dyspnea)))) 
(FLAG  .  (NOT  (SET  .  (ACTIVE-SET)))))) 

Redundant  operators,  such  as  those  resulted  from  the  interpretation  of  double  nega¬ 
tives,  are  also  removed  here.  Note  that  if  the  overall  expression  is  labelled  FLAG 
and  then  the  answer  cannot  be  determined  efficiently,  and  the  user  will  be  informed 
so.  Now  the  expression  is  in  a  form  that  can  be  easily  executed  by  a  simple  inter¬ 
preter.  Set  operations  will  only  be  performed  between  SET  functions,  and  individual 
members  of  a  (enumerable)  set  will  be  checked  in  other  cases. 

The  difficulty  addressed  above  is  partially  solved  in  the  LUNAR  system  by  the 
introduction  of  the  notion  of  "smart"  quantifiers  [Woods78].  In  the  grammar  infor¬ 
mation  system,  tables  of  specialized  enumeration  functions,  together  with  sufficient 
conditions  for  their  use,  were  stored  associated  with  each  basic  class  over  which 
quantification  could  range.  A  resolution  theorem  prover  was  then  used  to  verify  that 
the  restriction  of  a  given  quantification  implied  one  of  the  sufficient  conditions  for  a 
more  specialized  class  enumeration  function  (within  a  pre-specified  time  limit).  If 
so,  the  more  specialized  function  will  be  used.  Here,  we  try  lo  achieve  the  same 
effects  by  reordering  the  basic  enumeration  functions  according  to  their  "sizes". 
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enumerable  (small)  and  inenumcrablc  (large). 


4.3.4.  Anaphora 

The  system  also  has  limited  capability  of  handling  anaphoric  references. 
Instead  of  trying  to  develop  a  general  logical  form  of  our  MRL  for  resolving  such 
references  [Nash-Weber77],  we  devise  a  few  simple  heuristics  to  guess  at  the 
intended  meaning  of  such  sentences.  It  is  felt  that  there  is  no  robust  solution  to 
the  problem,  at  least  not  to  inherent  ambiguities,  and  so  it  will  be  more  helpful  to 
the  user  to  have  some  feedback  on  how  his  input  sentence  is  understood. 

For  the  purpose  of  dereferencing  pronouns  used  in  a  sentence,  the  system 
keeps  track  of  the  most  recent  disease  entity  that  is  referenced  by  the  user  and  the 
set  of  disease  entities  from  the  most  recent  answer  given  by  the  system.  The  need 
for  two  records  can  be  appreciated  from  the  following  example: 

what  are  the  possible  causes  of  myocarditis  ? 

The  causes  of  myocarditis  are: 
streptococcal  infection,  ... 

(i)  what  are  its  manifestations  ? 

The  manifestations  of  myocarditis  are: 

congestive  heart  failure,  prolonged  PR  interval,  ... 

(ii)  which  (one  of  them)  is  active  ? 

Among  the  above,  there  is: 
streptococcal  infection. 

Therefore,  the  pronouns  its  and  which  are  referring  to  different  entities.  This  is 
resolved  by  a  simple  heuristic:  previous  system’s  answer  is  always  preferred  unless 
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the  pronoun  is  singular  and  the  response  contains  more  than  one  entity. 

The  related  problem  of  handling  ellipses,  which  requires  inference  of  missing 
parts  of  a  sentence  structure  from  a  previous  sentence,  is  performed  by  saving  the 
previous  parse  tree  and  trying  systematically  to  replace  part  of  the  tree  with  the 
new  construct  to  make  up  a  complete  sentence.  Again,  semantic  information  is 
used  to  disambiguiate  different  possible  substitutions  of  missing  constituents. 


4.3.5.  Remarks 

The  interpreter  has  been  designed  strictly  according  to  the  needs  of  the  sys¬ 
tem.  Thus,  only  a  very  few  aspects  of  the  mammoth  task  of  natural  language  under¬ 
standing  have  been  considered.  However,  we  have  demonstrated  how  a  variety  of 
helpful  questions,  with  flexible  sentence  structures,  can  be  handled  by  a  simple  sys¬ 
tem.  Figures  4.3-4  (a),  (b)  show  examples  of  queries  that  the  system  is  able  to 
interpret.  An  experienced  user  who  becomes  aware  of  its  limitations  should  be  able 
to  retrieve  most  of  the  information  he  desires.  It  should  however  be  noted  that  the 
user  is  assumed  to  be  causal,  rational  and  cooperative  and  have  no  problem  in 
expressing  himself  in  simple  (reasonably)  correct  English,  so  that  detail  checking  of 
the  correctness  of  his  input  sentences  is  not  necessary.  Currently,  implementation 
of  a  more  complete  ATN,  case  frame  fitting  for  appropriate  associations  of  syntactic 
constituents,  and  proper  handling  of  anaphora  are  in  progress  [Ho-Tai8l]. 

A  feature  noticeably  missing  from  this  implementation  is  the  handling  of  univer¬ 
sal  quantifications  as  provided  in  the  LUNAR  system.  It  is  assumed  that  the  purpose 
of  each  question  asked  by  the  user  is  to  find  out  some  specific  properties  or  rela¬ 
tionships  about  certain  disease  entities,  rather  than  to  investigate  their  general  pro¬ 
perties.  Omission  of  this  feature  allows  significant  simplification  and  thus  improved. 


-99- 


efficiency  in  the  different  stages  of  the  interpretation  process.  However,  IT  grealer 
emphasis  is  to  be  placed  on  the  educational  functions  of  the  system,  it  may  be 
worthwhile  to  include  different  types  of  quantifiers  in  the  interpreter. 

Another  useful  extension  worth  considering  is  the  inclusion  of  commas  in  the 
input  sentences  to  help  to  disambiguate  the  confusion  with  the  order  of  the  conjunc¬ 
tions.  There  is  currently  very  limited  feedback  to  the  user  to  ensure  him  that  his 
question  is  correctly  understood.  It  is  probably  desirable  to  have  a  deep  structure 
that  will  capture  more  details  about  the  input  sentence,  such  as  in  Winograd’s  sys¬ 
tem,  to  allow  regeneration  of  the  sentence  from  this  structure.  However,  this  will 

•a 

also  require  a  much  larger  dictionary.  Implementation  of  affix  generation  routines 
will  help  to  reduce  the  size  of  the  dictionary. 
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wh  questions 

why  do  you  need  the  urine  test 

why  ask  for  a  blood  test  and  ECG  findings 

why  have  you  accepted  benign  systemic  hypertension 

why  is  left  ventricular  failure  rejected 

what  is  the  strongest  evidence  of  LVF 

what  is  the  status  of  the  active  hypotheses 

what  is  the  condition  proteinure a 

what  disease  might  cause  the  symptom  chest  pain 

what  is  the  estimated  likelihood  of  aortic  valve  stenosis 

what  does  ESR  stand  for 

what  is  the  blood  pressure  of  the  patient 

which  initial  sign  supports  aortic  regurgitation  most 

which  active  hypotheses  have  the  sign  dyspnea 

which  possible  cause  of  dyspnea  is  active 

which  disease  is  an  active  hypothesis  that  may  manifest  the  sign  dyspnea 

how  do  you  know  that  acute  rheumatic  fever  should  be  accepted 

how  can  you  tell  that  acute  rheumatic  fever  is  present 

how  did  you  eliminate  congestive  heart  failure 

how  should  I  confirm  mitral  stenosis  now 

how  many  active  hypotheses  that  is  of  the  intermediate  type  are  there 

Figure  4.3-4  (a)  Sample  queries 
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imperatives 

print  central  cyanosis  please 

show  me  the  treatments  of  hypertrophic  obstructive  cardiomyopathy 
please  tell  me  what  the  attributes  of  chest  pain  are 
give  me  some  information  on  malignant  hypertension 

yes-no  questions 

was  angina  pectoris  rejected 

does  cardiomyopathy  have  a  sign  of  the  ECG  type 
is  it  true  that  cardiomyopathy  has  an  ECG  sign 

are  there  any  active  hypotheses  that  are  supported  by  the  symptom  palpitation 
logical  operators 

which  disease  is  not  active  and  has  the  sign  paradoxical  S2  split  or  the  sign  narrow  S2 
split 

what  is  the  name  of  the  dormant  or  rejected  hypothesis  that  shows  paradoxical  S2  spli 
or  narrow  S2  split 

are  mitral  stenosis  and  hyperthyroidism  but  not  aortic  regurgitation  possible  causes  of 
loud  SI 

is  it  true  that  either  tricuspid  stenosis  or  atrial  septal  defect  is  present. 

pronouns/ ellipses 

what  are  its  manifestations 
its  likely  causes 
are  they  active 
which  is  accepted 

show  me  the  causes  of  hemoptysis  and  their  certainties 
what  are  rejected  and  how  do  you  conclude  that 

Figure  4.3-4  (b)  Sample  queries  (coni' d) 
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4.4.  Knowledge  acquisition 


In  this  section  we  outline  a  possible  user  interface  for  accessing  and  modifying 
the  knowledge  base  of  the  system.  The  problems  of  modifying  the  large  collection  of 
data  structures  in  the  knowledge  base,  in  particular  the  difficulties  encountered  in 
adding  new  instances  of  existing  data  types,  are  examined.  The  character  of  the 
errors  commonly  made  and  the  kinds  of  information  required  to  avoid  them  are 
explored.  It  is  briefly  shown  how  the  representation  of  such  structural  and  organiza¬ 
tional  information  about  the  knowledge  representation  can  be  readily  incorporated 
in  a  PSN  interpreter  [Kramer80]  so  as  to  make  the  process  of  data  structure 
management  both  easier  and  less  prone  to  error.  The  design  is  based  on  the  basic 
ideas  found  in  TEIRESIAS  [Davis77b,  77c,  77d],  a  program  used  to  facilitate  the 
interactive  transfer  of  expertise  from  a  human  expert  to  the  knowledge  base  of  a 
rule-based  system.  In  the  following  discussion  we  assume  the  reader  has  some  expo¬ 
sition  to  the  concepts  in  PSN. 


4.4.1.  The  problem 

The  difficulty  of  making  even  relatively  simple  changes  to  any  sizable  data  base 
is  a  well-known  phenomenon.  In  order  to  add  a  new  instance  of  a  data  type,  the 
programmer  immediately  faces  the  major  task  of  gathering  a  wide  range  of  informa¬ 
tion,  including  the  structure  of  the  data  type  and  its  interrelations  with  other  types 
in  the  system,  which  is  often  scattered  through  various  sources.  In  such  situation 
there  are  twTo  types  of  common  errors,  which  result  in  inconsistency  and  incom¬ 
pleteness  of  the  knowledge  base.  Although  disease  frames  can  be  entered  indepen¬ 
dently  as  units  of  information,  these  data  structures  do  cross-reference  each  others. 
The  addition  of  a  new  instance  may  require  significant  effort  to  maintain  the  existing 
interdependencies  and  the  necessary  bookkeeping  information.  For  instance. 
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because  of  Ihe  inherent  problems  of  naming  the  disease  entities  consistently,  unin¬ 
tentional  mismatches  can  easily  occur  when  a  new  disease  frame  is  added  without 
supervision  from  the  system.  On  the  other  hand,  in  order  that  the  knowledge  base 
will  always  remain  complete,  when  a  reference  to  a  new  instance  of  certain  data 
type  is  made,  the  user  should  be  informed  of  the  situation  and  be  guided  in  supply¬ 
ing  the  required  information  for  the  definition  of  the  new  data  structure.  This  is 
particular  important  for  users  that  are  noL  willing  to  learn  the  details  of  the  system. 

Another  fundamental  source  of  difficulty  here  is  size.  The  different  data  types 
have  their  own  structural  organization,  their  own  set  of  interrelations  with  other 
data  types,  and  their  own  set  of  requirements  for  integration  into  the  system.  Since 
modifications  to  a  data  type  design  (for  example,  in  order  to  accommodate  a  new 
type  of  medical  information)  have  to  be  carried  out  on  all  of  its  instances,  the 
efficient  retrieval  and  processing  of  this  set  is  another  problem  involving  the  han¬ 
dling  of  large  numbers  of  structures. 

As  a  step  further,  we  would  like  the  knowledge  acquisition  system  not  only  to 
provide  guidance  for  the  user  to  modify  the  knowledge  base  but  also  to  help  track¬ 
ing  down  the  problem  and  making  the  necessary  changes  when  erroneous  behaviour 
is  discovered.  With  the  simple  control  scheme,  it  is  often  possible  to  narrow  down 
the  cause  of  the  trouble  to  a  small  number  of  frames.  Finally,  there  should  be  some 
degree  of  consistency  check  on  the  input  information,  such  as  dependence  of  evi¬ 
dence. 


4.4.2.  Basic  approach 

The  system  is  provided  with  a  store  of  knowledge  about  its  own  representations 
so  that  the  acquisition  of  new  concepts  can  be  carried  out  in  a  high-level  dialog 
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which  help  to  transfer  information  efficiently.  The  necessary  knowledge,  which 
includes  information  about  both  the  frame  structures  and  their  organization,  is 
specified  in  a  frame  itself,  a  meta-frame  in  PSN  (referred  simply  as  frame 
hereafter).  The  interpreter  for  the  PSN  language  will  require  augmentation  so  that 
the  system  can  make  use  of  its  new  knowledge  to  keep  track  of  the  details  of  data 
structure  construction  and  maintenance,  and  to  provide  the  users  guidance  in  carry¬ 
ing  out  these  tasks.  For  instance,  the  to-add  program  will  not  only  initiate  some 
kind  of  storage  operation  that  will  indicate  if  the  given  object  is  an  instance  of  its 
associated  class,  but  will  also  actively  prompt  the  user  for  each  piece  of  data. 

There  are  several  levels  of  organization  of  the  knowledge  about  the  underlying 
representations.  At  the  highest  levels,  a  frame  hierarchy  links  the  frames  together, 
indicating  what  categories  of  data  structures  exist  in  the  system  and  the  relation¬ 
ships  between  them.  At  the  next  level  of  organization,  there  are  the  individual 
frames,  the  basic  unit  around  which  knowledge  about  representations  is  organized. 
Each  frame  indicates  the  structure  and  interrelationships  of  a  single  type  of  data 
structure.  At  lowest  level  are  the  primitive  data  structures  from  which  the  frames 
are  built;  these  offer  knowledge  about  specific  conventions  at  the  programming 
language  level. 

The  frames  are  organized  into  a  PART-OF/IS-A  hierarchy,  indicating  the  global 
organization  of  representations  in  the  knowledge  base  and  offering  a  convenient 
overview  of  them.  Part  of  the  hierarchy  for  the  current  system,  represented  by  an 
AND/OR  tree,  is  shown  in  Figure  4.4-1.  The  need  for  the  PART-OF  relationships  in 
the  hierarchy  is  motivated  by  our  desire  to  incorporate  knowledge  about  the  organi¬ 
zational  structure  of  the  primitive  concepts  (e.g.  frame  or  production  rule)  in  it. 
The  notion  of  inheritance  of  properties  is  extensively  used  in  the  network.  Acquisi¬ 
tion  of  a  new  instance  of  one  of  the  primitives  is  in  part  a  process  of  descent 
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through  this  hierarchy,  thus  providing  a  useful  structuring  of  the  dialog.  Acquisition 
of  new  types  of  conceptual  primitives  is  viewed  as  a  process  of  adding  new  branches 
to  this  network. 


structure 


Figure  4.4-1  Frame  hierarchy 


The  data  structure  is  progressively  broken  up  as  the  heirarchy  is  descended. 
For  instance,  the  meta-slot  manifestation-slot  will  consist  of  5  slots  for  the  <siot- 
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name>,  <disease-entity>,  <quantifier>,  where  and  when  conditions.  The  decomposi¬ 
tion  proceeds  until  primitive  frames,  such  as  date,  text  and  condition,  at  the  bottom 
of  the  hierarchy  are  reached.  These  primitive  frames  arc  responsible  for  requesting 
the  specific  data  types  from  the  user. 

In  each  individual  frame,  several  different  types  of  information  can  be  found: 

•  bookkeeping  information  ( author ,  data,  description ). 

•  inter-frame  organization  information  ( is-a ), 

■  the  structure  of  its  instances  (slots  and  metaslots), 

•  interrelationships  with  other  data  structures 

(the  to-add  program,  which  can  perform  actions  with  side  effects  which  are 
characteristic  of  intances  of  the  class,  e.g.  each  disease  entity  is  entered  in 
a  list  associated  with  its  test  type). 

•  a  pointer  to  all  current  instances  (the  to-fetch  program), 

Figures  4.4-2  and  4.4-3  show  the  frames  for  a  data  structure  and  a  disease  entity 
respectively.  Note  that  since  the  structure  frame  is  the  is-a  parent  of  the  entity 
frame,  information  in  the  former  need  not  be  reproduced  in  the  latter. 

The  most  detailed  knowledge  about  representations  is  found  in  a  small  set  of 
programs  associated  with  each  frame.  They  deal  with  aspects  of  the  representation 
that  are  at  the  level  of  programming  language  constructs  and  conventions.  The  to- 
add  program,  for  instance,  at  the  lowest  level  contains  information  about  how 
request  for  data  should  be  phrased  and  checking  the  of  the  input  data  type;  and  at 
the  higher  levels  contains  information  on  how  the  data  structures  collected  from  all 
the  parts  should  be  organized.  It  should  be  noted  that  these  programs  can  also  be 
represented  as  frames,  organized  in  a  manner  that  clearly  reflects  their  interrela¬ 
tionships. 
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frame  structure 


instanee-of  meta-structure  "where  ( 

description  «-  "format  for  a  top  level  data  structure"  , 
author  «-  "Steve  Ho-Tai"  , 

date  4-  "25-9-50"  ) 

with 

slots 

synonyms  :  synonyms  ; 
author  :  author  ; 
date  :  date  ; 
source  :  text  ; 
description  :  text  ; 

meta-slots 

attributes  :  attribute-slot  ; 


end 


Figure  4.4-2  Frame  for  a  data  structure 


frame  entity  is-a  structure 

instance-of  meta- structure  where  ( 

description  *-  "format  for  a  disease  entity"  , 
author  «-  "Steve  Ho-Tai”  , 

date  «-  "20-9-80"  ) 

with 

slots 

important  :  condition  : 
condition  :  condition  ; 

meta-slots 

manifestations  :  manifestation-slot  ; 
associations  :  association- slot  ; 
evidence-of  :  evidence-slot  ; 
treatment  :  treatment-slot  ; 


end 


Figure  4.4-3  Frame  for  a  disease  entity 
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Figure  4.4-4  Organization  of  frames 
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4.4.3.  Use  of  representational  knowledge 


We  next  consider  how  the  information  about  representation  can  be  used  to  allow 
the  expert  to  teach  the  system  new  instances  of  conceptual  primitives. 

From  the  information  provided  by  the  expert  the  appropriate  frame  in  the  net¬ 
work  is  selected  as  starting  point  for  the  acquisition  process.  If  there  is  no  context 
to  rely  on  for  this  selection,  the  default  is  to  start  at  the  root  of  the  frame  network. 
Since  information  is  distributed  through  the  network,  the  first  step  is  to  get  to  the 
root,  marking  the  path  while  ascending. 

The  next  step  is  to  descend  back  down  the  network  along  the  marked  path, 
using  each  frame  along  the  way  as  a  further  set  of  instructions  for  acquiring  the 
new  instance.  If  the  process  encounters  a  part  of  the  path  that  is  not  marked,  the 
expert  is  asked  to  choose  the  among  its  instances,  using  the  to- fetch  program,  one 
with  a  description  that  best  describes  the  item  being  constructed.  At  each  node  in 
the  network  the  acquisition  process  is  directed  by  a  simple  interpreter  whose  struc¬ 
ture  consists  of  three  basic  operations: 

■  use  the  structure  description  part  of  the  frame  to  guide  the  addition  of  new  com¬ 
ponents  to  the  instance, 

•  attend  to  any  updating  according  to  the  information  specified  in  to-add, 

•  add  the  new  item  to  the  frame’s  list  of  instance.?  associated  with  f.n-fetch. 

At  an  AND-node  all  branches  will  be  taken  separately  and  data  structures  con¬ 
structed  by  these  parts  are  passed  up  to  the  parent  where  they  are  aggregated. 
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4.4.4.  System’s  guidance  for  debugging 


As  mentioned  before  we  would  like  the  system  to  help  the  user  to  track  down 
the  source  of  erroneous  information  in  the  knowledge  base.  We  shall  illustrate  this 

capability  with  a  hand-simulated  example,  in  which  we  try  to  reproduce  the  kind  of 
system  behaviour  of  TE1RES1AS. 

(you  are  in  the  Question  Answering  mode) 

*  start 

Please  enter  data  of  the  patient: 

Name? 

1 :  Jack  Smith 

Sex? 

2:  male 


Please  enter  initial  findings: 

5:  cough 

6:  hemoptysis 

7:  dyspnea 

Which  one(s)  of  the  following  describes  its  setting: 

1)  on  exertion 

2)  lying  fiat 

3)  during  sleep 
8:  1  2 

9:  pulsus  alter  nans 

Please  enter  ascultation  findings: 

14:  pansystolic  murmur 
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[The  system  continues  to  gather  data.  from,  the  user  until,  a  diagnosis  is  reached] 


The  following  diseases  are  confirmed: 
l)  congestive  heart  failure,  causing 
2)  pulmonary  edema 

*  DA 

[Request  to  enter  the  data  acquisition  mode] 

Are  all  these  correct? 

*  no 

Are  there  any  diagnoses  that  should  not  be  made? 

*  yes 

Which  one(s)? 

*  2 

Confirming  findings  of  pulmonary  edema  are: 

1)  strongly  (0.8)  suggested  by  hemoptysis 

2)  weakly  (0.2)  suggested  by  dyspnea 


Which  one(s)  should  have  a  weaker  evidential  strength? 

*  none 

Absent  but  expected  findings  of  pulmonary  edema  are: 

1)  sometimes  (0.3)  shows  elevated  systemic  blood  pressure 

2)  sometimes  (0.3)  shows  tachycardia 

Which  one(s)  should  have  a  stranger  evidential  strength? 

*  1 


What  is  the  new  evidential  strength  for  systemic  blood  pressure? 

*  often 

Are  you  satisfied  with  pulmonary  edema? 

*  y 

Are  there  any  missing  diagnoses? 

*  what  hypotheses  are  active 
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[User  may  ask  questions  to  find  out  the  status  of  the  hypotheses  or  general 
information  in  the  knowledge  base] 

They  are; 

1)  respiratory  tract  infection 

2)  pulmonary  embolism 

*  pulmonary  embolism 

Confirming  findings  of  pulmonary  embolism  are: 

1)  weakly  (0.3)  suggested  by  hemoptysis 


Which  one(s)  should  have  a  stronger  evidential  strength? 

*  n 

Are  there  any  findings  missing  from  pulmonary  embolism? 

*  y 

Which  one(s)? 

*  multiple  defects 

Is  this  a  new  disease  entity  (if  not,  please  rephrase  it)? 

*  y 

Now  please  tell  me  a  few  things  about  this  new  disease  entity  ... 

[The  system  retrieves  the  frame  for  a.  disease  entity  a.nd,  read  it  to  find 
out  what  it  needs  to  know  about  any  disease  entity] 

Please  give  its  full,  formal  name. 

*  multiple  defects  in  lung  scan 

Now  please  give  all  its  synonyms  or  abbreviations. 

*  MDLS 

* 

Please  give  its  type. 

*  lung  scan 

Please  give  a  short  description  of  it. 
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4.4.5.  Remarks 


As  seen  from  the  examples,  the  frames  for  representing  data  structures  have  a 
straightforward  syntax  and  hence  can  themselves  be  specified  by  0.  frame,  a  meta- 
meta-frame  called  meta-structure.  The  frame  framework  can  therefore  be  pushed 
back  a  level  to  represent  knowledge  about  knowledge  about  representations.  We  can 
then  use  the  identical  frame  interpretation  process  described  above  to  acquire 
knowledge  about  a  new  representation  structure  from  the  expert.  This  then  forms  a 
small  kernel  of  knowledge  from  which  everything  else  can  be  built. 

Describing  representations  at  different  levels  of  detail  offers  a  framework  for 
organizing  and  keeping  track  of  the  required  information.  It  provides  a  useful 
degree  of  flexibility  in  the  system,  because  the  multiple  levels  of  description  insulate 
changes  at  one  level  from  the  other  levels.  For  example,  modification  to  information 
associated  with  the  to-programs  can  change  the  implementation  of  a  representation 
without  impacting  its  logical  structure  (cf.  data  independence  in  data  base  manage¬ 
ment  systems).  To  a  large  extent,  the  system  is  able  to  assure  knowledge  base 
integrity.  It  can  assure  a  form  of  completeness  by  making  sure  that  the  expert  is 
reminded  to  supply  every  necessary  component  of  a  structure  and  all  other 
appropriate  structures  will  be  informed  about  the  newly  added  item.  There  is  also 
complete  type  checking  so  that  no  interaction  with  the  expert  will  result  in  incon¬ 
sistent  data  structures  in  the  knowledge  base. 


4.5.  Graphics  interface 

This  section  briefly  describes  the  design  of  an  interactive  graphics  system  for 
setting  up  and  modifying  a  knowledge  base  of  hierarchical  data  structures,  and 
examines  its  potential  use  as  a  user  interface  for  data  maintenance  and  educational 
purposes  [Ho-TaiSQ].  Many  existing  systems  provide  extensive  facilities  for  entry 
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and  manipulation  of  text.  Text  entry,  insertion,  deletion,  search  and  display  are  all 
performed  interactively  with  a  small  number  of  key  strokes.  In  this  system  simi¬ 
larly  powerful  and  general  interactive  tools  are  implemented  for  editing  tree-like 
data  structures.  The  broad  information  bandwidth  of  graphic  display  and  useful 
graphic  interaction  technique  are  exploited  to  provide,  hopefully,  powerful  and  flexi¬ 
ble  editing  tools.  An  example  of  a  similar  interactive  program  which  provides 
eflective  human-machine  communication  is  SECS  (Simulation  and  Evaluation  of 
Chemical  Synthesis),  which  assists  a  chemist  in  heuristically  searching  and  evaluat¬ 
ing  the  space  of  good  synthetic  path  of  certain  types  of  chemical  compounds 
[Wipke78]. 


4.5.1.  The  problem 

As  described  in  chapter  2,  the  knowledge  base  consists  of  many  uniform  data 
units  linked  together  to  form  a  PART-OF  hierarchy.  The  modularity  of  the  knowledge 
base  allows  modifications  to  be  done  independpntly  and  with  great  ease.  However, 
the  underlying  structure  is  not  immediately  apparent  from  the  raw  data.  Therefore, 
the  first  goal  of  the  system  is  to  provide  facilities  for  conveying  this  structural  infor¬ 
mation  quickly  and  efficiently  through  the  use  of  graphic  displays.  Information  on 
the  inter-relationship  between  frames  that  are  very  useful  and  can  be  extracted 
easily  from  a  visual  display  include: 

•  causal  relations  of  diseases; 

•  associations  of  a  disease  with  its  manifestations  and  a  manifestation  with  its 

possible  causes; 

•  consistency  and  missing  links  in  the  data; 

•  information  for  differential  diagnosis. 

Tools  are  developed  to  help)  the  user  in  displaying  such  information  in  an  easily- 
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digestable  fashion. 


The  next  step  is  then  to  implement  tools  for  maintaining  the  knowledge  base. 
Not  only  should  the  system  allow  efficient  setup  of  the  initial  data  and  subsequent 
incremental  addition,  it  should  also  provides  tools  for  making  modifications  to  the 
existing  data.  According  to  the  nature  of  editing  involved  this  can  be  spilt  into  two 
related  subtasks,  namely  tree-editing  and  node-editing.  The  first  task  deals  with  the 
links  among  the  frames  while  the  second  concerns  with  the  contents  within  the 
frames. 

Flexibility  is  one  of  the  main  concern  kept  in  mind  throughout  the  design  in 
order  to  allow  the  accommodation  of  future  goals.  A  promising  future  goal  of  the 
system  is  to  use  it  also  as  an  output  interface  to  show  how  the  data  are  used,  that 
is  how  the  trees  are  traversed  during  a  diagnosis  session,  for  the  purposes  of 
training/debugging  the  main  system  and  possibly  education. 


4.5.2.  Basic  approach 

As  mentioned  above,  the  first  task  of  the  system  is  to  display  the  hierarchy  of 
frames  to  reveal  the  underlying  structure.  Two  approaches,  each  employing 
different  displaying  techniques,  are  taken  to  accomplish  this  goal. 

In  the  first  approach,  a  "map''  of  the  entire  knowledge  base  is  displayed  to  show 
how  a  particular  frame  fits  into  the  hierarchy.  A  difficulty  that  immediately  arises 
is  the  space  constraint,  which  is  proposed  to  be  solved  by  segmentating  the  whole 
structure  using  only  a  subset  of  the  terminal  nodes  at  a  time.  Another  related 
problem  is  the  display  of  the  node  labels  --  the  frame  identifiers  are  usually  very 
long  and  it  is  hard  to  devise  unique  and  meaningful  abbreviations  for  them.  A  sim¬ 
ple  approach  is  employed  whereby  the  names  are  replaced  by  unique  identification 
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numbers.  However,  this  conflicts  with  the  design  principle  of  minimizing  memoriza- 

* 

tion  on  the  user’s  part.  To  compensate  for  this  dictionaries  with  flexible  paging 
tools  are  provided  to  allow  looking  up  nodes  quickly  either  by  names  or  ID  numbers. 
The  map  and  dictionaries  can  be  used  interchangeably  for  referencing  purposes 
whichever  seems  more  appropriate.  Simple  heuristics  are  used  to  minimize  the 
cross-overs  in  the  map  display  and  commands  are  provided  to  move  nodes  around 
and  to  highlight  sub-maps,  so  as  to  facilitate  the  user  in  reading  the  map. 

Node  editing  is  allowed  to  be  done  in  two  different  modes,  comparable  to  the 
interline  and  intraline  editing  modes  found  in  some  text  editors.  In  the  first  mode, 
frames  are  entered  as  "batch"  keyboard  or  file  input  --  useful  for  initial  setup.  A 
simple  compiler  is  implemented  to  process  the  data.  In  the  other  mode,  selective 
insertion,  deletion  and  so  on  are  done  with  the  help  of  another  graphic  input  device, 
namely  a  tablet.  The  text  editing  techniques  are  extended  to  allow  the  user  to  navi¬ 
gate  through  the  hierarchy  and  examine  the  contents  of  individual  frames.  This  is 
carried  out  with  a  set  of  four  icons  by  first  choosing  one  of  these  icons  to  indicate 
the  direction  of  navigation  and  then  pointing  at  the  desired  entity  when  descending 
the  hierarchy. 

In  the  second  approach  for  displaying  frames,  interrelationships  between  frames 
are  the  main  information  that  are  tried  to  be  brought  out.  In  the  simple  technique 
being  used  the  sub-hierarchy  under  a  single  user-specified  disease  frame  is  first 
displayed,  other  sub-hierachies  may  then  be  superimposed  on  it  to  show  their 
interaction  as  crosslinks  between  them.  Some  commands  are  provided  to  hightiight 
different  aspects  of  the  interactions. 

Tree-editing  is  allowed  as  a  more  flexible  alternative  to  modifying  links  through 
node-editing.  A  simple  scheme  which  uses  the  inking  procedure  and  a  character 
recognizer  is  developed  to  give  a  simple  command  set  for  manipulating  links  quickly 
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and  effectively.  Commands  are  also  provided  for  a  variety  of  I/O  functions  —  read¬ 
ing  from  text  or  code  files,  intermediate  save  command,  listing  of  knowledge  base  in 
part  or  in  whole,  hardcopy  of  display  structures,  etc. 


4.5.3.  Remarks 

A  crucial  mass  of  the  system  capabilities  were  tried  out  to  test  the  viability  of 
the  major  design  concepts.  It  is  implemented  on  a  PDP  11/45  minicomputer  run¬ 
ning  the  UNIX  time-sharing  system,  with  a  "Graphic  Wonder"  as  the  display  and  a 
digitizing  tablet  as  the  sole  input  device.  Snapshots  of  the  display  during  a  session, 
using  some  artificial  test  data,  are  shown  in  Figures  4.5-1  through  4.5-5.  The  system 
has  been  designed  primarily  for  users  from  the  medical  community,  who  is  not 
expected  to  have  any  formal  training  in  computer  science  and  data  processing.  The 
major  success  of  the  system  design  is  perhaps  the  uniform  protocol,  provision  cf  an 
on-line  structured  user-manual  and  feedback  at  various  points  of  the  interaction. 
They  allow  the  user  to  interact  effectively  with  the  system  in  carrying  out  his  task. 
The  commands  from  different  modes  together  provide  a  flexible  set  of  tools  for  edit¬ 
ing.  The  user  can  choose  a  command  most  appropriate  and  convenient  for  the  task 
at  hand.  For  example,  massive  changes  are  probably  best  done  by  redefinig  the 
frame  through  the  teletype,  few  but  localized  changes  by  text-editing  and  ext  -nsive 
structural  changes  by  graph-editing.  The  next  step  to  take  is  perhaps  not  to  try  to 
pick  out  the  most  useful  commands  and  discard  the  others,  but  rather  to  put  the 
system  to  more  extensive  use  and  enhance  these  different  sets  of  editing  tools. 
From  what  may  be  considered  a  fair  success  of  this  implementation,  the  extension 
as  an  output  interface  discussed  before  may  deserve  serious  consideration. 
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Chapter  5 


Conclusion 


5.1.  Concluding  remarks 

In  this  thesis  we  have  described  the  initial  design  of  a  knowledge-based  consul¬ 
tation  system  for  medical  diagnosis  and  therapy  recommendation.  Most  of  the  goals 
set  out  at  the  beginning  have  been  at  least  partially  achieved.  The  design  has  been 
user-oriented  emphasizing  the  ease  of  interaction  and  modification.  Towards  this 
end  the  medical  knowledge  incorporated  in  our  model  of  clinical  decision  making  is 
represented  by  a  uniform  and  modular  scheme  with  the  PSN  formalism.  The 
knowledge  representation  has  the  following  characteristics: 

•  knowledge  is  organized  around  concepts,  related  to  the  clinical  status  of  the 

patient,  which  are  represented  as  frames,  linked  with  several  sets  of  rela¬ 
tions  into  an  associative  memory: 

«  the  manifestation/evidence-of  relations  organize  the  frames  into  a  PART-OF 
hierarchy,  allowing  for  varying  levels  of  abstraction  of  description; 

•  IS-A  relationships  are  provided  to  relate  general  disease  categories  with  their 

specializations; 

•  links  are  associated  with  clinical  conditions  under  which  they  can  be  applied, 

thus  allowing  the  knowledge  base  to  be  adapted  to  the  context  of  individual 
cases; 

•  knowledge  about  representation  can  be  represented  with  meta-structures, 

which  in  turn  can  be  represented  with  meta-meta-structures,  allowing  all 
information  on  data  acquisition  and  the  data  themselves  be  built  on  a  small 
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core  of  knowledge; 


Our  design  is,  to  some  extent,  a  synthesis  of  several  existing  systems  —  one 
which  hopefully  combines  their  merits.  Its  control  structure  is  based  on  an  integra¬ 
tion  of  several  paradigms  of  problem-solving  and  a  formalism  of  inexact  reasoning, 
which  models  the  clinician’s  reasoning  processes  in  many  important  respects: 

•  the  (data-directed)  process  of  hypothesis  activation  dominates  the  early  part  of 

the  diagnostic  session  as  explanation  of  the  findings  is  searched  for,  and  the 

emphasis  is  gradually  shifted  onto  hypothesis  evaluation  (goal-directed); 

•  only  a  small  number  of  active  hypotheses  are  entertained  at  any  one  time  by 

focusing  attention  effectively; 

•  hypotheses  are  evaluated  not  only  locally  on  how  well  they  explain  the  findings, 

but  also  globally  for  coherence  and  adequacy; 

•  a  variety  of  case-building  strategies  are  used  to  evaluate  and  refine  hypotheses; 

•  cost-benefit  analysis  enters  into  different  decision  making  processes,  such  as 

selection  of  lab  tests  and  therapeutic  procedures; 

•  uncertainties,  in  the  input  data  and  medical  knowledge,  are  common  and  some¬ 

thing  the  clinician  has  to  cope  with: 

Owing  to  the  severe  constraint  imposed  by  the  incompleteness  of  the  knowledge 
base,  many  interesting  aspects  of  the  control  mechanism  have  not  been  tested.  It  is 
likely  that  part  of  this  scheme  will  need  to  be  enhanced  or  revised.  However,  wc 
hope  the  reader  is  convinced  that  with  the  various  diagnostic  concepts  incorporated 
in  the  system  and  the  flexibility  of  the  design,  changes  in  the  manipulation  of  the 
belief  measures,  question  selection  strategies,  termination  or  confirmation  criteria 
to  generate  different  decision  behaviour  can  be  easily  accommodated. 
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Finally,  it  is  quite  clear  that  a  pleasant  user  environment  can  be  built  around 


this  system  design. 


5.2.  Directions  for  future  research 

Remaining  largely  untouched  is  the  problem  of  prognosis.  Little  research  has 
been  done  in  assessing  the  time  course  of  a  disorder.  Work  done  by  Fagan 
[Fagan79]  on  the  rather  constrained  problem  of  the  monitoring  of  a  ventilation 
machine  for  patients  after  surgical  operations  does  not  lend  itself  readily  to  a  more 
general  domain.  It  is  often  necessary  to  defer  a  decision  and  to  observe  the  evolu¬ 
tion  of  a  disorder  in  order  to  obtain  clues  for  making  a  diagnosis  or  selecting 
therapy  (e.g.  the  administration  of  digoxin).  To  make  use  of  such  temporal  informa¬ 
tion  we  need  to  incorporate  in  the  knowledge  base  a  stronger  notion  of  time  and  the 
severity  of  a  disorder.  We  also  need  to  provide  the  system  with  information  on  the 
effects  and  side-effects  of  each  treatment.  Although  these  are  partly  taken  into 
account  by  the  vjhen  clause  associated  with  each  manifestation,  it  is  desirable  to 
have  a  more  general  approach.  We  suggest  that  each  type  of  treatment  be 
represented  by  a  frame  and  treated  in  many  respects  as  a  disease  entity.  This 
seems  workable  especially  for  drug  therapy,  with  the  effects  in  the  manifestation 
slots,  attributes  such  as  dosage  and  interval,  and  their  default,  values.  These  medi¬ 
cation  frames  may  also  be  organized  into  an  1S-A  hierarchy  to  provide  an  apparently 
great  saving  in  representation.  For  instance,  we  may  have  sedative  activated  as  a 
treatment  and  later  make  use  of  the  information  stored  in  its  frame  to  determine 
the  specific  drug,  its  dosage,  etc.  perhaps  from  the  patient’s  age,  sex  and  clinical 
condition.  If  the  therapy  is  accepted,  then  its  effects  can  be  activated  to  provide  a 
prognostic  view  of  the  disease  development. 
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Since  the  system  is  designed  without  any  particular  domain  in  mind,  it  is 
tempting  to  try  to  extend  it  to  cover  an  area  as  wide  as  possible,  such  as  the  whole 
of  internal  medicine.  Such  an  attempt  may  require  additional  facilities  for  organiz¬ 
ing  the  disease  frames,  for  example,  to  allow  only  a  subset  of  closely  related  frames 
to  be  used  as  the  context  in  which  the  hypotheses  are  effectively  evaluated.  Coordi¬ 
nation  of  these  modules  may  resemble  that  between  a  general  practitioner  and  some 
specialists  [Chandrasekaran79]  (though  its  applicability  for  specific  diseases  is  ques¬ 
tionable). 

Another  area  which  also  deserves  further  research  is  the  interaction  between 
this  system  with  the  subsystems  for  data  interpretation.  One  may  consider  the 
structure  of  a  feedback  loop,  or  try  to  generalize  and  extend  our  data  and  control 
structures  to  these  "low-level"  processes.  In  subdomains  where  the  pathophysiologic 
mechanism  is  well-understood,  our  "systematic"  approach  to  medical  diagnosis  may 
prove  inadequate.  For  such  areas  we  might  want  to  develop  specific  reasoning 
models  which  will  interact  with  the  main  frame  as  subsystems. 

Finally,  we  note  that  the  approach  currently  taken  to  evaluate  the  performance 
of  some  of  the  more  successful  computer-based  consultation  systems  [Weiss73a] 
[Yu79],  based  on  a  crude  comparison  between  the  systems’  and  experts’  advices  on 
a  few  cases,  provides  conclusions  of  very  limited  value.  Important  aspects  of  the 
problem-solving  process  must  first  be  identified  to  form  the  basis  of  evaluation.  Sta¬ 
tistically,  to  demonstate  that  a  system  has  attained  a  certain  level  of  performance 
will  also  require  a  much  more  tightly  controlled  experiment.  Development  of  a 
rigorous  evaluation  procedure  will  allow  a  more  meaningful  comparison  between 
different  methodologies  and  help  to  find  out  their  advantages  and  disadvantages, 
providing  a  guideline  for  future  research. 
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Appendix  I:  Syntax  of  frame  structure 


The  following  shows  the  syntax  used  for  the  frame  definitions  presented  in  this 
document.  Character  strings  enclosed  in  angle  brackets  "<>"  denote  non-terminal 
symbols  while  those  in  bold  or  italics  are  language  keywords.  "The  symbols 
are  terminal  symbols.  Square  brackets  ”[}"  surround  optional  constructs  while  the 
vertical  bar  "j"  separates  alternates  in  a  production  rule.  Repeated  constructs  are 
factored  out  with  brace  brackets  "j}".  Finally,  a  +  is  attached  to  each  construct 
that  can  be  repeated  zero  or  more  time. 


<knowledge-base>  «- 
<frame>  «- 


<type>  «- 

<attribute>  «- 

<  attribute-type  >  <- 

<manifestation>  <- 

<evidence>  <- 

<association>  <- 


kb  <identifier>  :  <frame>  + 

frame  <identifier>  [,  <  identifier^4’ 

[is-a  <identifier>  [,  <identifier>]+‘] 
instance-of  entity  where  ( 


author  «- 
date  <- 
[source  <- 
type  <- 

[description  <- 
[condition  +- 
[important 
with 

[manifestations 

[associations 

[attributes 

[evidence-of 

[treatment 

end 


<text>  , 

<text>  , 
<text>  ,] 

<type>  , 
<text>  ,] 
<condition>  ,] 
<condition>]  ) 

<manifestation>+] 
<association>  ] 

< attributed] 
<evidence>+] 
<treatment>  ] 


ultimate  j  intermediate  j  others  j  initial  j  specific  j 
type  of  data  (e.g.  ECG,  auscultation,  angiocardiography) 

<slot-name>  :  <attribute-t3^pe>  [where  <expression>] 
[when  <condition>]  ; 

yes-no  j  choice  J  value 

<slot-name>  :  <identifier>  [such  that  <condition>] 
<quantifier>  present  [when  <condition>]  ; 

<slot-name>  :  <identifier>  <quantifier>  suggested 
[when  <condition>]  ; 

< slot-name >  :  <identifier>  [such  that  <condition>] 
<quantifier>  <associati on-type >  [when  <condition>]  ; 
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<association-type>  «- 
<treatment>  <- 

<  quantifier  >  «- 

<condition>  <- 

<expression>  «- 


precedes  |  follows  |  related  j  conflicts 

<siot-name>  :  <identifier>  [where  <expression>] 

[when  <condition>]  ; 

definitely  J  strongly  J  moderatetly  j  -weakly  j  insignificantly  | 
always  j  usually  j  often  |  sometimes  j  rarely 

always  |  parenthesized  logical  expression  with  operations 
no,  or  and  and  (in  order  of  precedence). 

(  <identifier>  *-  <value>  [,  <identifier>  *-  <value>]+  ) 


The  general  format  of  a  slot  therefore  consists  of  3  parts,  namely  a  slot  name, 
a  frame  type  and  an  optional  qualification.  The  slot  name  is  a  simple  identifier 
which  must  be  unique  within  the  frame.  The  frame  type  identifies  the  associated 
concept  ( choice ,  general ,  etc.  are  speciai  frames  --  keywords),  while  the 
qualification  gives  some  information  about  the  association.  A  when  clause  may 
appear  in  the  qualification  to  indicate  when  the  association  is  applicable  and,  where 
appropriate,  the  strength  of  the  association  is  represented  by  a  quantifier.  The 
attributes  of  the  related  frame  can  be  checked  or  initialized  with  a  such  that  or 
where  clause  respectively  (c/.  prerequisites  and  dependents  in  ALVEN’s  representa¬ 
tion  [Tsotsos80]). 
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Appendix  II:  A  statistical  interpretation  of  measure  of  Belief 


It  is  noted  that  experts  are  often  willing  to  state  degrees  of  belief  in  terms  of 
conditional  probabilities  but  they  refuse  to  follow  the  assertions  to  their  logical  con¬ 
clusions  (for  instance,  they  will  agree  that  although  an  evidence  may  support  a 
hypothesis  with  degree  X,  it  does  not  support  its  negation  with  degree  1  -  X).  It  is 
then  perhaps  revealing  to  note  that  the  notion  of  measure  of  belief  can  be  given  a 
probabilistic  definition  [ShortlifTe76]  which  leads  to  some  intuitively  appealing 
results. 

Definition: 


P(ff\E)  -  P(H) 

1  -  P(H) 


By  defining  MB  as  the  proportionate  increase  of  probability  with  respect  to  the  a 
priori  probability,  we  need  not  consider  the  initial  probabilities  {MB  [ H ,  <p]  =  0,  ini¬ 
tially).  By  substituting  H  for  H,  we  immediately  get 

MB  \HE 1  -  =  [\-P{H\E)-]-{X-P{H)\ 

L  •  J"  1  -P(H)  1  -  £1  -P(fT)] 

_  PUP}  zjPMaM} 

P(H) 


1)  First  of  all,  it  is  obvious  that  MB[IJ,E ]  *  1  -  MB[H,E]  as  desired.  Also  note  that 
when  the  a  priori  belief  in  a  hypothesis  is  small,  i.e.  P{H )  is  close  to  zero,  the  MC 
of  a  hypothesis  confirmed  by  an  evidence  is  approximately  equal  to  its  conditional 
probability  on  the  evidence: 


MC[H,E]  =  MB  [. H,E  ]  -  MB  [. H,E  ] 

=  P(gi-P(/)W' - 0  KPmE)- 


whereas,  MC[H,E]  =  -MC[H,E]  ra  -P{H  \E). 
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2)  Now  consider  the  relationship  between  NF  and  SF.  From  the  definitions, 


r  ,  MB  [. Hi ,  E&Ei  ]-  MB  [Hit  E  ] 

NF[Di,Mj]  = - — - =-■-■■■■  - 

:  1  -  MB[H„E] 

P(fU\JS&Ej)-^P(ffl)  _  P(m\E)  -  PiHl ) 
_  l-P(//t)  _ 1-P(H i) 

j  _  PUij\E) -P(/ft) 

l-P(Hi) 


P{,Hi\E&Ej)  -  P(Hi\E) 
l-P(Jh\E) 


P{Hi\E)  -  P{Hi\EAEj) 
P{Hi\E) 


Similarly,  interchanging  the  role  of  D,  and  M<  (Hi  ->  E,  and  E,  -*  HA, 


SF[Ui,Di] 


P(Ilf\E&Ef)  -  P(Hj\E) 

1-P(HS\E) 

P(flf\Ei:Ei)P(Hi\E)  -JPjfljlE^PfME)^ 

P(Hj\E)P(Ht\E) 

PjHjAHjlF^- P(Hj\E)P(Hi\E) 

P(Hj\E)P[H,\E) 


P(Hi\E)-  PjJU&HjjH)  -  P(Hi\E)P(Hi\E) 

P(Hj\E)P(Hi\E) 


P(Hj\E)[l  -P(Hj\E)]  -  PjH^E&E^PjHjlE) 
P(Hj\E)P(Ht\E) 

P(Hi\E)  -  P{Hi\E&Ej) 

P(Hi\E) 


Therefore,  NF{Di,Mj\  -  SFlMj.Di]  as  we  have  intuitively  concluded  in  section  3/3.2. 
This  result  allows  us  to  use  the  necessity  factors  as  expectation  measures  for  the 
manifestations  of  a  given  disease  hypothesis. 
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3)  Finally,  consider  the  Paradox  of  the  Ravens  presented  by  Carl  Hampel  [Hampel45] 
in  his  discussion  on  the  logic  of  confirmation,  which  runs  as  follows.  If  we  let  H 1  be 
the  statement  that  "all  ravens  are  black"  and  Hz  the  statement  that  "all  nonblack 
things  are  nonravens."  It  is  clear  that  H j  and  Hz  are  logically  equivalent,  and  so 
drawing  an  analogy  with  conditional  probability  we  might  be  tempted  to  assert  that 
=  MC[Hq,E]  for  all  E.  It,  however,  appears  counter-intuitive  to  state  that 
the  observation  of  a  green  vase  supports  H \  even  though  the  observation  does  seers 
to  support  H 2.  The  reason  the  paradox  occurs  is  because  we  are  convinced  that,  '  we 
are  right  in  our  intuitive  assumption  that  we  should  look  at  randomly  selected 
ravens  and  not  randomly  selected  nonblack  things  in  the  generalization  that  all 
ravens  are  black."  We  shall  see  that  this  agrees  with  the  statistical  definition  of  MC 
given  above. 

If  we  denote 

Ei  =  the  observation  of  a  raven  that  is  found  to  be  black,  and 
E 2  =  the  observation  of  a  nonblack  object  that  is  found  to  be  nonraven, 


and  assume  that  we  initially  have  no  knowledge  regarding  either  colors  of  ravens  nor 
distributions  of  colours  in  the  universe  (without  altering  the  qualitive  nature  of  the 
final  result),  we  can  quite  straightforwardly  show  that  [Shortliffe76l: 


MC\H i,E i\  -  1/m,  and 


MC[Hi,Ez] 


1 


(2 71  -  771  ) 


where  m  and  n  are  the  numbers  of  ravens  and  nonblack  objects  in  the  universe, 
respectively.  Since  n  »  m,  we  see  that  MC\H \tE  {]  »  MC  [H\,Ez\  and,  in  fact,  that 
MC[H i,Ez]  is  practically  zero,  agreeing  with  our  intuition. 
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Appendix  III:  Grammar  for  a  language  for  representing  ATNs 


For  the  sake  of  completeness,  the  features  of  ATNs  are  briefly  described  here  to 
illustrate  the  flexibility  of  this  methodology  [Woods70].  A  recursive  transitive  net¬ 
work  is  a  directed  graph  with  labelled  states  and  arcs,  a  distinguished  state  called 
the  start  state,  and  a  distinguished  set  of  states  called  final  states.  The  labels  on 
the  arcs  may  be  state  names  as  well  as  terminal  symbols.  A  terminal  symbol 
represents  the  class  of  words  that  is  expected  at  that  point  of  the  parsing.  The 
interpretation  of  an  arc  with  a  state  name  as  its  label  is  that  the  state  at  the  end  of 
the  arc  will  be  saved  on  a  stack  and  the  control  will  jump  (without  advancing  the 
input  tape)  to  the  state  that  is  the  arc  label.  When  a  fined  state  is  encountered, 
then  the  stack  may  be  popped  by  transferring  control  to  the  state  which  is  named 
on  the  top  of  the  stack  and  removing  that  entry  from  the  stack.  An  attempt  to  pop 
an  empty  stack  when  the  last  input  character  has  just  been  processed  is  the  cri¬ 
terion  for  acceptance  of  an  input  sentence.  These  transition  networks  are  aug¬ 
mented  with  (i)  structure-building  actions  associated  with  the  arcs  of  the  grammar 
network  allowing  for  the  reordering,  restructuring,  and  copying  of  constituents 
necessary  to  produce  deep-structure  representations  of  the  type  normally  obtained 
from  a  transformational  analysis,  and  (ii)  conditions  on  the  arcs  allowing  for  a 
powerful  selectivity  which  can  rule  out  meaningless  analyses  and  take  advantages  of 
semantic  information  to  guide  the  parsing.  Figure  III- 1  shows  some  samples  of  the 
ATNs  used  in  our  system. 

A  transition  network  consists  of  a  set  of  states,  each  of  which  in  turn  consists  of 
a  set  of  arcs.  The  first  element  names  the  arc  and  the  third  element  is  an  arbitrary 
test  which  must  be  satisfied  in  order  for  the  arc  to  be  followed.  The  CAT  arc  can  be 
followed  if  in  addition  the  current  input  symbol  belongs  to  the  specified  lexical 
category,  while  the  PUSH  are  causes  a  pushdown  to  the  state  indicated.  The  TST 
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arc  permits  an  arbitrary  test  to  determine  if  the  arc  is  to  be  followed.  In  these 
arcs  there  are  the  structure-building  actions  and  the  terminal  action  which  specifics 
the  state  to  which  control  is  to  be  passed.  SETR,  SENDR,  LIFTR  and  HOLD  are  used 
to  set  register  values  for  storing  information  needed  for  building  the  deep-structure. 
They  work  on  different  set  of  registers:  in  the  current  level,  lower  level,  higher  level 
and  a  ’'global”  data  storage,  respectively.  Together  with  the  VIR  arc,  the  HOLD 
action  allows  the  interpretation  of  certain  syntactic  structure  to  be  postponed.  The 
two  possible  terminal  actions,  TO  and  JUMP,  indicate  whether  the  input  pointer  to  be 
advanced  or  not.  Finally,  the  POP  arc  is  a  dummy  arc  which  indicates  the  condition 
under  which  the  state  is  to  be  considered  a  final  state,  and  the  form  to  be  returned 
as  the  value  of  the  computation.  Figure  ITT-1  shows  samples  of  the  ATNs  used  in  our 
system  (S:  is  at  the  topmost  level). 
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PUSH  VP:  geU(*JNFINTnVE) 


JUMP 


POP 


WKD  there  CAT  V 


CAT  V  getf(*.AUX)  CAT  NEG 

/  \  r  JUMP 

VP:  - >VP:AUX - >  VPrSENSE 


JUMP 


JUMP  SUBJ  =  tp 


CATV 


CKECKtVG 


CAT  PREP 
CHECK:  V-OBT-PRFIP 


Figure  III-2  Sample  of  ATNs  used  in  the  system 
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